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F oreword 



Many of you who will read this book will do so in the context of an engineering or 
technical background. And that’s a good thing because the more engineers and technical 
operations people understand this brilliant technology the quicker it will be adopted in 
countries that have yet to make their digital radio transition strategies. But it is equally 
important that the content providers and programmers understand what this platform can 
provide them in terms of service and commercial opportunities as we begin the 21 st 
century. . .the Digital Era. 

I’m a programmer by background and it was the programming opportunities afforded by 
Digital Audio Broadcasting (DAB, Eureka 147) that led me to believe it would be 
instrumental in maintaining Radio’s relevance in a world of digital choices. Interference 
free and CD like sound, particularly attractive in the urban mobile environment, are only 
the beginnings of the DAB advantages. It is spectrum efficient, flexible in terms of bit rate 
to program content and cheaper to operate than analog systems. 

The added bonus to DAB is that the content provider can add value for the listener with 
program and non-program associated data. Everything from simple song titles and 
performer to weather, news and traffic flashes are capable of being displayed today. Soon 
we will have integrated IT systems for the car or home capable of sophisticated navigation, 
GPS and local DAB traffic and travel information capable of interaction through GSM or 
phone return. This technology will also facilitate the introduction of full multi media 
services, again for the mobile or stationary market. What an opportunity for traditional 
radio broadcasters to bring new services and commercial ventures to their audiences. 

DAB is a relatively mature technology, over ten years old. But in the first 7 or 8 years 
most services were experimental and most receivers were prototype. But in the last couple 
of years real operation services have begun. In Europe, the UK has an aggressive plan for 
national and regional multiplexes that will cover about 85% of the population within two 
years. Today in London 35 digital services are on the air, half of which are only available 
through DAB. 

Germany likewise has an aggressive plan that will see 85% of their population covered 
within three years. Spain, Portugal, the Nordic countries, and the Benelux countries all are 
on the air or have aggressive plans to be operational in the next year or two. France is 
debating how to move from experimental services to full operational activity. And receivers 
for the car, home and soon hand-held mobile devices are and will be increasingly available 
in the marketplace. As this book is published, over 16 receiver models and 3 DAB PC cards 
are available and more are coming to market. Prices were high and still are at premium 
levels but they are steadily falling to affordable prices. 

DAB is important in the context of Europe. Since the technology was developed in 
Europe, the rest of the world is looking to Europe for successful transition strategies and 
DAB adoption. But from the beginning countries like Canada (currently operational in 4 
major cities) and Australia (planning operational services 2001/02) found the technology 
innovative and right for their markets. Singapore began operations two years ago, Taiwan 
China and Hong Kong all have experimental multiplexes. Brunei, Malaysia, Korea and 
India are all considering experimental services in the near future. Many other countries are 
assessing the technology and making DAB plans based on their cultural and market needs. 
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I think those of us who have been part of DAB for almost the last decade were surprised 
that there wasn’t instant takeoff and overnight success for DAB services. We were probably 
naive, because this is a replacement technology for analog radio which is ubiquitous around 
the world and the magnitude of change required is enormous. We began to understand this 
in the mid nineties and formed the World DAB Forum late in 1996 to facilitate the 
transition on a Global basis. It is a remarkable group of industry stakeholders including 
broadcasters, network providers, regulators, consumer electronic manufacturers, electronic 
business and internet related companies and research institutes. 

They have a common purpose: to seek solutions to spectrum and regulatory issues, 
service availability, commercial technical applications, receiver diversity and availability, 
and the best practices for market transition. We share best practices nationally, regionally 
and globally and our agenda changes to meet the current needs and issues of the day. Our 
goal is to facilitate an orderly transition from analog to DAB radio, ensuring the future 
relevance, cultural and economic well being of radio. 

It is with this backdrop to the potential of DAB and the current operational 
commitments to DAB services around the world that I commend this technical review of 
the DAB system in the hope that more knowledge about this wonderful system will 
generate more services and more countries making operational commitments. 



Michael McEwen, London, Autumn 2000 

President, World DAB Forum 




Preface 



The new digital radio system DAB (Digital Audio Broadcasting), developed within the 
Eureka 147 project in close co-operation with the EBU, is a very innovative and universal 
multimedia broadcast system which is just being introduced and has the potential to replace 
the existing AM and FM audio broadcast services in many parts of the world in the near 
future. In several countries in Europe and overseas, broadcasting organisations, network 
providers and receiver manufacturers are already implementing digital broadcasting 
services using the DAB system. 

DAB is very different from conventional analogue broadcasting systems. Most of the 
system components such as perceptual audio coding (MPEG- 1/2), OFDM channel coding 
and modulation, the provision of a multiplex of several services and data transmission 
protocols (MOT) are new concepts typical of digital broadcasting. Even experts in analogue 
transmission systems will feel less familiar with these new elements of broadcasting 
technology. Therefore, the aim of this book is to inform the expert reader about the basic 
concepts of the DAB system 

Besides introducing the basics, the focus of the book is on the practical implications of 
service provision and the new infrastructure required in broadcasting houses, for multiplex 
and network management, and for coverage planning. Also some elements of up-to-date 
receiver concepts are described. 

The level of standardisation of the DAB system is rather advanced and the relevant 
recent international standards and related documents are introduced and referred to for easy 
access for the reader seeking technical details. An extended bibliography is also provided. 

The book is designed as a well-structured technical guide by a team of expert authors 
closely involved in the development and standardisation of DAB. This ensures competent 
presentation and interpretation of the facts based on the latest state of the art. The book is 
primarily aimed at professional users such as developers and manufacturers of professional 
devices for distribution networks or consumer receivers, planning engineers and operational 
staff with broadcasters, network providers, service and content providers. For other 
technically minded people who wish to become acquainted with the concepts of digital 
broadcasting, the book will serve as a comprehensive introduction to the field since it 
contains all the information needed for further study. 

The book may also serve for academic or educational use because it is based on the 
latest versions of the relevant international standards and publications, as well as actual 
experience with pilot applications and first implementation of services. 

The editors wish to take this opportunity to express their thanks to all the contributors 
for the enjoyable co-operation and their excellent work, which most of them had to 
complete in addition to their demanding jobs. Many thanks also to Mrs Helga Schon who 
was kind enough to design a portion of the drawings for the book. 

The editors also wish to thank the publishing team at John Wiley & Sons for their 
interest, understanding and patience during the writing and production period. 

May this book help to introduce the DAB system world-wide. 



The Editors 



Berlin/Numberg, Autumn 2000 
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1.1 General 

The new digital radio system DAB (Digital Audio Broadcasting) is a very innovative and 
universal multimedia broadcast system which will replace the existing AM and FM audio 
broadcast services in many parts of the world in the future. It was developed in the 1990s 
by the Eureka 147/DAB project. DAB is very well suited for mobile reception and provides 
very high robustness against multipath reception. It allows use of single frequency networks 
(SFNs) for high frequency efficiency. 

Besides high-quality digital audio services (mono, two-channel or multichannel 
stereophonic), DAB is able to transmit programme-associated data and a multiplex of other 
data services (e.g. travel and traffic information, still and moving pictures, etc.). A dynamic 
multiplex management on the network side opens up possibilities for flexible programming. 

In several countries in Europe and overseas broadcast organisations, network providers 
and receiver manufacturers are going to implement digital broadcasting services using the 
DAB system in pilot projects and public services. 

DAB works very differently from conventional broadcasting systems. Most of the 
system components such as perceptual audio coding, channel coding and modulation, 
multiplex management or data transmission protocols are new solutions and typically not so 
familiar to the expert in existing analogue or digital broadcast systems. 

The level of standardisation of the DAB system is rather advanced and the various 
recent international standards and related documents are introduced and referred to for easy 
access for the reader seeking technical details. 
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1.2 Radio in the Digital Age 

Radio broadcasting is one of the most widespread electronic mass media comprising 
hundreds of programme providers, thousands of HF transmitters and billions of receivers 
world-wide. Since the beginning of broadcasting in the early 1920s, the market has been 
widely covered by the AM and FM audio broadcasting services. 

Today we live in a world of digital communication systems and services. Essential parts 
of the production processes in radio houses were changed to digital ones in recent times, 
beginning with the change from conventional analogue audio tape to digital recording on 
tape, hard disk or CD, via digital signal processing in mixing desks and digital transmission 
links in distribution processes. The first steps in the introduction of digital broadcasting 
services were taken by the systems NICAM 728 (Near Instantaneously Companded Audio 
Multiplex, developed by the BBC for stereo television sound in the VHF/UHF bands), DSR 
(Digital Satellite Radio), or ADR (Astra Digital Radio, see section 1.6.2), but they were not 
suited to replace the existing conventional services completely, especially for mobile 
reception. 

Normally, it takes a period of a human generation (or at least a period in the life of a 
receiver type generation, i.e. approximately 10 years) to replace an existing broadcasting 
system by a new one. Therefore, strong reasons and very convincing advantages are 
required to justify the introduction of a new broadcasting system. 



1.3 Benefits of the Eureka 147 DAB System 

However, there will always be some problems, or additional effort will be needed when 
replacing an existing technology by a new one, such as 

• lack of transmission frequencies 

• costs for development and investment 

• looking for providers for new non-conventional services (e.g. data services) 

• solving the chicken and egg problem (who will be first - the service provider or the 
receiver manufacturer?). 

Nevertheless, the new Eureka 147 DAB system provides a wealth of advantages over 
conventional audio broadcast systems such as analogue VHF/FM or AM radio, and also 
partly over other existing digital broadcast systems such as DSR, ADR, etc. The following 
list will only highlight some key advantages as an overview; many more details will be 
explained in the corresponding sections of the book. 

Quality of service 

DAB uses all the possibilities of modem digital communication technologies and can thus 
provide a much higher level of quality of service, such as 

• Superior sound quality : DAB users can enjoy pure undistorted sound close to CD 
quality. New features such as Dynamic Range Control (DRC) or Music/Speech Control 
can be used individually by customers to match the audio quality to their needs. 

• Usability: Rather than searching wavebands, users can select all available stations or 
preferred formats from a simple text menu. 
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• Perfect reception conditions : With just a simple, non-directional whip antenna, DAB 
eliminates interference and the problem of multipath while in a car. It covers wide 
geographical areas with an even, uninterrupted signal. Once full services are up and 
running, a driver will be able to cross an entire country and stay tuned to the same 
station with no signal fade and without altering frequency. 



Wide range of value-added services 

DAB is quite unique in that both music and data services can be received using the same 

receiver. One receiver does it all, such as 

• Typical audio broadcasting (main service): Music, drama, news, information, etc., can 
be received in monophonic or stereophonic form as is well known from conventional 
radio programmes; there is also the potential to transmit multichannel (5.1 format) 
audio programmes as well. 

• Programme-associated data (PAD): DAB broadcast receivers can display text 
information in far greater detail than RDS, such as programme background facts, a 
menu of future broadcasts and complementary advertising information. Receivers 
attached to a small screen will display visual information such as weather maps or CD 
cover images. 

• Information services: Services from sources other than the broadcasting station are 
included within the same channel for the user to access at will. These include news 
headlines, detailed weather information or even the latest stock marked prices. 

• Targeted music or data services: Because digital technology can carry a massive 
amount of information, specific user groups can be targeted with great accuracy 
because each receiver can be addressable. 

• Still or moving pictures: Data can also appear as still or moving photographic pictures, 
accompanied by an audio service or as separate information. 



Universal system layout 

The DAB system has a fairly universal and well-standardised system layout which allows 

applications for all known transmission media and receiving situations. 

• Standardisation: The level of international standardisation of all basic principles and 
transmission tools for the new DAB system is very high (more than 40 international 
standards cover all necessary details). 

• Unique system design: DAB services will be available manly on terrestrial but also 
suited for cable and satellite networks, and the same receiver could be used to provide 
radio programmes and/or data services for national, regional, local and international 
coverage. 

• Wide choice of receivers: It is possible to access DAB services on a wide range of 
receiving equipment including fixed (stationary), mobile and portable radio receivers, 
optionally completed with displays or screens, and even personal computers. 



Flexibility of multiplex configuration 

DAB services are transmitted in a flexible multiplex configuration, which can be easily 
changed instantaneously to the actual needs of the content providers. 
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• Multiplex configuration: The arrangement of services in a DAB multiplex may be 
changed instantaneously to match the needs of the providers of programmes or data 
services, without interrupting ongoing services. 

• Bit rate flexibility: The programme provider can choose an appropriate bit rate for a 
certain audio programme according to its quality, for instance less than 100 kbit/s for a 
pure speech programme, 128 kbit/s for monophonic or 256 kbit/s for stereophonic 
music; also half sampling frequency can be used for lower quality services. So the 
available bit rate can be split optimally between different services of a DAB ensemble. 



Transmission efficiency 

Compared to conventional broadcast systems much less economic effort in investment and 
operation is needed for a DAB transmission system. 

• Lower transmission costs for broadcasters: DAB allows broadcasters to provide a 
wide range of programme material simultaneously on the same frequency. This not 
only makes room for a vastly increased number of programmes to increase user choice, 
but also has important broadcast cost-cutting implications. 

• Lower transmission costs for transmitter network providers: For digital transmission a 
DAB transmitter needs only a fraction of the electrical energy compared to a 
conventional AM or FM transmitter. 

• Frequency efficiency: DAB transmitter networks can be designed as Single Frequency 
Network (SFNs), which saves a lot of transmission frequencies and thus transmission 
capacity on air. 

These advantages of DAB (and there are more if we look further into the details) justify the 
introduction of DAB into the media world in order to replace the existing conventional 
radio systems step by step over a longer period. 



1.4 History of the Origins of DAB 

1.4.1 Steps of Development 

In the early 1980s the first digital sound broadcasting systems providing CD like audio 
quality were developed for satellite delivery. These systems made use of the broadcasting 
bands in the 10 to 12 GHz region, employed very little sound data compression and were 
not aimed at mobile reception. Thus, it was not possible to serve a great majority of 
listeners, such as those travelling in cars. Also, another feature of the well-established FM 
radio could not be provided by satellite delivery, namely “local services”. Consequently 
terrestrial digital sound broadcasting was considered as an essential delivery method for 
reaching all listeners. 

At first investigations were initiated by radio research institutes looking into the 
feasibility of applying digital modulation schemes in the FM bands. However, the 
straightforward use of pulse code modulation (PCM) in the upper portions of the FM band 
generated intolerable interference in most existing FM receivers and was spectrally very 
inefficient. Mobile reception was never tried and would not have succeeded. A much more 
sophisticated approach was definitely necessary. 
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In Germany the Federal Ministry for Research and Technology (BMFT, now BMBF) 
launched a research initiative to assess the feasibility of terrestrial digital sound 
broadcasting comprising more effective methods of sound data compression and efficient 
use of the radio spectrum. A study completed in 1984 indicated that promising results could 
be expected from highly demanding research activities. As a new digital sound 
broadcasting system could only be implemented successfully by wide international 
agreement, BMFT set the task for its Project Management Agency at DLR (German 
Aerospace Centre) to form a European consortium of industry, broadcasters, network 
providers, research centres and academia for the development of a new digital audio 
broadcasting system. Towards the end of 1986 a consortium of 19 organisations from 
France, Germany, The Netherlands and the United Kingdom had signed a co-operation 
agreement and applied for notification as a Eureka project. At the meeting in December 
1986 of the High Level Representatives of the Eureka partner states in Stockholm the 
project, now called “Digital Audio Broadcasting, DAB”, was notified as the Eureka 147 
project. National research grants were awarded to that project in France, Germany and The 
Netherlands. However, owing to granting procedures official work on the project could not 
start before the beginning of 1988 and was supposed to run for four years. 

Credit must also be given to the European Broadcasting Union (EBU), which had 
launched work on the satellite delivery of digital sound broadcasting to mobiles in the 
frequency range between 1 and 3 GHz, by awarding a research contract to the Centre 
Commun d’Etudes de Telediffusion et Telecommunications (CCETT) in Rennes, France, 
prior to the forming of the DAB consortium. As the CCETT also joined the DAB project, 
the work already begun for the EBU became part of the DAB activities and the EBU a close 
ally and active promoter for DAB. Later, this proved very important and helpful in relations 
with the International Telecommunications Union (ITU-R) and the standardisation process 
with the European Telecommunications Standards Institute (ETSI). 

From the beginning the goals set for the project were very demanding and difficult to 
achieve. Perfect mobile reception was the overall aim. In detail the list of requirements to 
be met included the following items: 

• audio quality comparable to that of the CD 

• unimpaired mobile reception in a car, even at high speeds 

• efficient frequency spectrum utilisation 

• transmission capacity for ancillary data 

• low transmitting power 

• terrestrial, cable and satellite delivery options 

• easy-to-operate receivers 

• European or better world-wide standardisation. 

The first system approach considered at least 16 stereo programmes of CD audio quality 
plus ancillary data to be transmitted in the 7 MHz bandwidth of a television channel. This 
definitely cannot be achieved by simply transmitting the combined net bit rates of 16 CD- 
like programme channels, which are around 1,4 Mbit/s each, over the TV channel. So a 
high degree of audio data compression without any perceptible loss of audio quality was 
mandatory. Data rates below 200 kbit/s per stereo channel had to be achieved. 

Unimpaired mobile reception was also required to overcome the adverse effects of 
multipath signal propagation with the associated frequency selective fading. 
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Audio data compression and the transmission method became the central efforts of the 
research project. Both tasks were addressed in a broad and comprehensive manner. For 
audio coding four different approaches were investigated: two sub-band coding systems 
competed with two transform coding systems. Similarly, for the transmission method four 
different schemes were proposed: 

• one narrow-band system 

• one single carrier spread- spectrum system 

• one multicarrier OFDM system 

• and one frequency-hopping system. 

All approaches were developed to an extent where - either through experimental evidence 
or at least by thorough simulation - a fair and valid comparison of the performance of the 
proposed solutions became possible. The period of selection of and decision for the best 
suited audio coding system and the most appropriate transmission scheme was a crucial 
moment in the history of the Eureka 147 consortium. For audio coding the greatest part of 
the selection process happened external to the consortium. All four coding schemes 
previously had been within the activities of the ISO/IEC Moving Pictures Experts Group 
(MPEG), which worked on standardisation for data compressed video and audio coding. 
There the solutions offered by Eureka 147 competed against 10 other entries from other 
countries, world-wide. The MPEG Audio Group set up a very elaborate and qualified audio 
quality assessment campaign that was strongly supported by Swedish Radio, the British 
Broadcasting Corporation and the Communications Research Centre, Canada, among 
others. The very thorough subjective audio quality tests revealed that the audio coding 
systems submitted by Eureka 147 showed superior performance and consequently were 
standardised by ISO/IEC as MPEG Audio Layers I, II and III. Within the Eureka 147 
consortium, after long consideration Layer II, also known as MUSICAM, was selected for 
the DAB specification. 

The process of choosing the most appropriate transmission method took place within 
the Eureka 147 consortium alone. In simulations performed according to rules worked out 
by the members the four approaches were put to the test. This showed that the broadband 
solutions performed better than the narrow-band proposal. Among the broadband versions 
the spread-spectrum approach had a slight advantage over the OFDM approach, while the 
frequency-hopping solution was considered too demanding with respect to network 
organisation. However, the OFDM system was the only one that was already available in 
hardware with field-test experience - in the form of the coded Orthogonal Frequency 
Division Multiplex (COFDM) system, while the spread-spectrum proposal by then was not 
developed as hardware at all and estimated to be very complex. So, the choice fell on 
COFDM, which has since proven to be an excellent performer. 

A further important decision had to be made relating to the bandwidth of the DAB 
system. From a network and service area planning point of view as well as obtainable 
frequency spectrum perspectives, an ensemble of 16 programmes on one transmitter with 
the 7 MHz bandwidth of a TV channel proved to be much too inflexible, although in 
experiments it had provided very good performance in a multipath environment. Therefore, 
a considerable but reasonable reduction in transmission bandwidth was necessary. In 
Canada experiments with the COFDM system revealed that substantial performance 
degradation begins around 1.3 MHz and lower. So, a reasonable bandwidth for a DAB 
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channel or “DAB block” was defined as 1.5 MHz. This allows several possibilities, as 
follows. 

A 7 MHz TV channel can be divided into four DAB blocks, each carrying ensembles of 
five to seven programmes. With four blocks fitting into 7 MHz service area planning is 
possible with only one TV channel, without having adjacent areas using the same DAB 
block. Furthermore, 1.5 MHz bandwidth is sufficient to transport one MPEG coded 
audio/video bit stream. 

After the above-mentioned important decisions had been made the members of the 
consortium all turned their efforts from their individual approaches to the commonly 
defined system architecture and with rapid progress developed the details of the complete 
basic DAB specification to be submitted to the international standardisation bodies. By that 
time, another European research project, the JESSI flagship project AE-14, was eagerly 
awaiting the DAB specification to begin the development of chip-sets for DAB. Also, the 
standardisation bodies like ETSI were well aware and waiting for the submission of the 
specification since Eureka 147 members had been very active in testing and presenting the 
results of their research and development on many important occasions, together with or 
organised by the EBU. 

The first official presentation of DAB took place at the World Administrative Radio 
Conference 1988 (WARC'88) in Geneva for the delegates of this conference. As the issue 
of a frequency allocation for digital sound broadcasting in the L-band around 1.5 GHz was 
up for decision, a demonstration simulating satellite reception was presented. A transmitter 
on Mont Saleve radiated the DAB signal in the UHF TV band, giving in downtown Geneva 
an angle of signal incidence similar to that from a geostationary satellite. Mobile reception 
was perfect and the delegates to the conference were highly impressed. In consequence the 
conference assigned spectrum in the L-band for satellite sound broadcasting with terrestrial 
augmentation permitted. 

One year later DAB was presented at the ITU-COM exhibition in Geneva. In 1990 tests 
with mobile demonstrations were run by Canadian broadcasters in Toronto, Ottawa, 
Montreal and Vancouver. DAB demonstrations and exhibits were shown at all International 
Radio Shows (IFA) in Berlin and the UK Radio Festivals in Birmingham since 1991. Four 
International DAB Symposia have been held up to now: 1992 in Montreux, 1994 in 
Toronto, 1996 again in Montreux and 1999 in Singapore. In 1994 a mobile DAB 
demonstration was presented at the Arab States Broadcasting Union Conference in Tunis. 
From all these activities world-wide recognition and appreciation was gained for DAB. 

The efforts and results of finding acceptance for DAB in the United States deserve an 
extra paragraph. As early as 1990 - by invitation of the National Association of 
Broadcasters (NAB) - the consortium presented DAB at low key at the NAB Convention in 
Atlanta, Georgia. This led to a very elaborate mobile demonstration and exhibition at the 
next NAB Convention in 1991 in Las Vegas, Nevada. Several of the NAB officials by that 
time were very interested in reaching a co-operation and licence agreement with the 
Eureka 147 consortium. However, strong opposition against that system - requiring new 
spectrum and the bundling of several programmes onto one transmitter - also arose. The 
US radio industry - that is, the broadcasters - feared new competition from the licensing of 
new spectrum. They preferred the idea of a system approach named “In-Band-On-Channel 
(IBOC)”, where the digital presentation of their analogue programmes is transmitted 
together and within the spectrum mask of their licensed FM channel (see also section 1.6). 
This of course would avoid the need of new licensing for digital broadcasting and thus keep 
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new competition away. However appealing and spectrum efficient this concept may be, the 
realisation might prove to be a very formidable technical task. No feasible development 
was available at that time, but fast development of an IBOC system was promised. 
Eureka 147 DAB performed flawlessly at Las Vegas, but those opposing the system 
claimed that the topography around Las Vegas was much too favourable to put DAB to a 
real test. So it was requested that DAB should next come to the 1991 NAB Radio Show in 
San Francisco to be tested in a very difficult propagation environment. One main 
transmitter and one gap filler were set up and mobile reception in downtown San Francisco 
was impressively demonstrated. An announced demonstration of IBOC broadcasting did 
not take place as the equipment was not ready. In spite of the good results the opposition to 
DAB gained momentum and the NAB officially announced its preference for an IBOC 
system. This was not in line with the intentions of the Consumer Electronics Manufacturers 
Association (CEMA) of the Electronics Industry Association (EIA) which came to an 
agreement with the NAB to run very strictly monitored laboratory tests in Cleveland and 
field trials in San Francisco comparing the Eureka 147 DAB with several US IBOC and 
IBAC (In-Band- Adjacent Channel) systems. The results were to be presented to the Federal 
Communications Commission (FCC) for rule-making relating to digital sound 
broadcasting. While Eureka 147 soon had equipment ready for the tests, the US proponents 
for a long time could not provide equipment and delayed the tests until 1995. In the 
laboratory tests DAB outperformed all competing systems by far. Claiming to have been 
unfairly treated in the evaluation of the laboratory tests, all but one of the US proponents of 
terrestrial systems withdrew from the field tests in San Francisco. For Eureka 147 the 
Canadian partner Digital Radio Research Inc. (DRRI) installed on contract a single 
frequency network of one main transmitter and two gap fillers to provide coverage for the 
area designated for mobile testing. Again DAB provided excellent performance as was 
documented in the final reports of the CEMA. Nevertheless, the United States still pursued 
the concept of IBOC although several generations of IBOC equipment and redesigns have 
only produced marginal performance. Even though numerous Americans now admit that 
DAB is a superior system they claim that it is not suited for the US broadcasters. The FCC 
has not made any decision on terrestrial digital audio broadcasting up to now. 

Standardisation in Europe and world-wide occurred at a better pace. The first DAB- 
related standard was achieved for audio coding in 1993, when the International 
Organisation for Standardisation/Intemational Electrical Commission (ISO/IEC) released 
the International Standard (IS 11172-3) comprising MPEG/Audio Layers I, II and III 
(IS 1 1 172). Also in 1993 ETSI adopted the basic DAB standard ETS 300 401, with several 
additional standards following later. The ITU-R in 1994 issued Recommendations 
(BS. 1114) and (BO. 11 30) relating to satellite and terrestrial digital audio broadcasting, 
recommending the use of Eureka 147 DAB mentioned as “Digital System A”. Consumer 
equipment manufacturers have also achieved several standards concerning the basic 
requirements for DAB receivers issued by the Comite Europeen de Normalisation 
Electrotechnique (CENELEC) (for more detailed information see section 1.5). 

Even though the technology, the norms and standards had been developed, the most 
critical issue was still not resolved : provision of frequency spectrum for DAB. W ARC '88 
had allocated 40 MHz of spectrum in the L-band to satellite sound broadcasting, allowing 
also terrestrial augmentation. Through the intense intervention of several national 
delegations W ARC '92 conceded primary terrestrial use of a portion of that allocated 
spectrum, which for several countries is the only frequency resource available. The L-band 
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- very well suited for satellite delivery of DAB - on the other hand becomes very costly for 
terrestrial network implementation. VHF and UHF are much more cost efficient than the 
terrestrial L-band. There was no hope of acquiring any additional spectrum below 1 GFIz 
outside of the bands already allocated to broadcasting. So, in 1995 the Conference 
Europeenne des Administrations des Postes et des Telecommunications (CEPT) convened a 
spectrum planning conference for terrestrial DAB in Wiesbaden, Germany, that worked out 
an allotment plan for DAB in VHF Band III (mostly former TV channel 12) and in the L- 
band from 1452 to 1467 MHz, allowing for all CEPT member states two coverages of 
DAB, one in the VHF range, the other in the L-band. This decision made possible the 
installation of experimental DAB pilot services in many countries of Europe and the 
beginning of regular services starting in 1997 with Sweden and the United Kingdom. More 
and more countries - also outside of Europe - are following suit. It is recognised that 
presently the available spectrum is not sufficient to move all existing FM and future 
programmes to DAB. 



L4.2 Organisations and Platforms 

A few of the organisations and bodies were or still are busy promoting and supporting the 
development and introduction of the DAB system world-wide: 

Eureka 147 consortium 

As mentioned above the Eureka 147 consortium was the driving force in the development 
of DAB. It formed a Programme Board, dealing with strategic planning, contractual and 
legal affairs, membership and promotion, a Steering Committee, planning the tasks of the 
working groups and making all technical decisions, and four working groups of varying 
task assignments. DLR in Cologne, Germany, was chosen to act as the managing agency 
and as the project office for Eureka affairs. 

While the original consortium did not accept new members at the beginning, this policy 
was changed when the basic specification of DAB (EN 300401) was ready to be released to 
the public for standardisation in 1992. The Eureka 147 Programme Board established rules 
for the entry of new partners into the consortium, requiring, for example, an entrance fee of 
DM 150,000 from industrial companies while entry for broadcasters and network operators 
was free of charge. The money earned from the entrance fees was mainly reserved for 
system promotion and to a smaller extent for organisational expenses. In 1993 the first new 
members were admitted and soon the consortium grew to 54 members from 14 countries. 

The research work of the original partners had led to a number of basic patents for 
DAB, individually held by several members. Consequently the consortium came to the 
conclusion to offer licences as a package to all necessary intellectual property rights (IPR) 
through authorised agents, one for receiver matters and another for transmitter and 
measuring equipment. 

In 1997 the idea of merging Eureka 147 with the promoting organisation WorldDAB 
(see below) was discussed with the result that a gradual merger was adopted and the final 
merger completed by the beginning of the year 2000. The activities of the EU-147 Project 
Office were transferred to the WorldDAB Project Office by the end of 1998. With the 
complete merger at the end of 1999 Eureka 147 ceased to exist. The members of the 
consortium can now co-operate in WorldDAB Module A, where technical issues of DAB 
are handled. 
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National DAB platforms 

The first national DAB platform DAB Plattform e. V. was initiated by the German Ministry 
for Education and Research in 1991. Founded as a national platform it soon accepted 
members from Austria and Switzerland and consequently dropped the word “national” from 
its name. It soon reached a membership of 52 organisations. The main objective of this 
platform was the promotion of the DAB system in German-speaking countries. The 
organisation of and activities for DAB presentations at the IFA events in Berlin were 
highlights in the promotion programme of the German platform. The members decided to 
end the existence of DAB Plattform e.V. by autumn 1998 after considering it to have 
achieved its objectives. 

Likewise, national platforms were established in several other European countries. 
France launched the Club DAB France , The Netherlands the Dutch DAB Foundation and 
the United Kingdom the UK DAB Forum. Promotional activities for the public and 
information to governmental agencies were again the objectives. At the time of writing 
these organisations continue to exist. 

EuroDAB/WorldDAB Forum 

Strongly stimulated and supported by the EBU, an organised co-operation of national 
platforms and other interested bodies lead to the founding of the EuroDAB Forum in 1995. 
A EuroDAB Project Office was set up at EBU Eleadquarters in Geneva. Membership rose 
quickly, bringing together interested organisations not only from Europe but from many 
parts of the world. Accordingly the General Assembly of EuroDAB decided to change its 
name to WorldDAB Forum in 1997. WorldDAB has around 100 members from 25 
countries world-wide. Promotion of DAB is again the main objective. Formerly EuroDAB 
and now WorldDAB issue a quarterly DAB Newsletter . The Forum has organised all but the 
first International DAB Symposia mentioned above. Technical, legal and regulatory as well 
as promotional issues are dealt with in several modules of the Forum. Since January 2000 
the Eureka 147 consortium has fully merged with the WorldDAB Forum. The WorldDAB 
Project Office moved from Geneva to London in 1998 and has also acted for Eureka 147 
since 1999. An extensive web-site of WorldDAB and DAB information is maintained at 
this office (www.worlddab.org). 



1.4.3 Milestones of Introduction 

At the time of writing (mid 2000) DAB remains - after more than 10 years’ development 
and standardisation - still in the phase of early introduction. However, in some regions 
programme and service providers have already started regular DAB services. Although 
DAB was primarily developed in the Eureka 147 project as a European broadcast system, it 
can be shown already in this phase that most of the industrial developed regions world-wide 
(i.e. Asia, Australia, and parts of America such as Canada and South America) are now 
interested in introducing this highly sophisticated universal broadcast system, too (see 
Appendix 2). The world-wide penetration of DAB is mainly supported by the WorldDAB 
Forum (see above). The actual status of introduction can be found at the web-site also given 
above. 

Only in the important market places of United States and Japan were being developed 
proprietary digital radio systems, see section 1.6. 
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1.5 International Standardisation 

The new DAB system shows a very high level of standardisation in its principles and 
applications, which is rather unusual for a new broadcasting or multimedia system. (There 
are more than 40 corresponding international standards and related documents.) 

After the initial development by some broadcasters and related research institutes, the 
first international standards were passed by the ITU-R (International Telecommunications 
Union, Radiocommunications Sector). In the period following the system has been 
developed within the Eureka 147 project, supported by a wide representation of 
telecommunication network providers, broadcasters, receiver manufacturers and related 
research institutes, in close co-operation with the EBU. 

In the following some basic standards of the DAB system are listed out. A more 
complete listing of the corresponding standards and related documents can be found in the 
Bibliography, which is referred to within various parts of the text. 

Basic requirements and system standards 

The basic ITU-R Recommendation (BS.774) shortly specifies as “ Digital system A ” the 
main requirements to the new broadcasting system DAB. Other ITU-R Recommendations 
(BS.789) and (BO. 11 30) regulate conditions needed for additional frequency ranges for 
emission. 

Several ITU World Administrative Conferences (WARC’79, WARC’85, WARC’88 
WARC’92) dealt with digital radio and allocated frequency bands for satellite and 
terrestrial digital sound broadcasting. More details were decided in (CEPT, 1995). 

As a result of developments within the Eureka 147 project, the Main DAB Standard or 
DAB Specification (EN 300401) was approved by ETSI European Telecommunications 
Standards Institute, which defines the characteristics of the DAB transmission signal, 
including audio coding, data services, signal and service multiplexing, channel coding and 
modulation. (The document was formerly often called the ETS; now it is a European 
Standard EN.). 

Audio coding 

DAB represents one of the most important applications of the generic ISO/MPEG- 1 
Layer II audio coding scheme (IS 11172). The use of this ISO/IEC coding standard is 
recommended by the ITU-R in (BS.1115), and certainly in the DAB Specification 
(EN 300401). DAB is also designed to transmit MPEG-2 Layer II audio (IS 13818), for 
instance for lower quality half-sample-rate transmission or multichannel audio 
programmes, see also (ES 201755). Other standards specify procedures and test bit-streams 
for DAB audio conformance testing (TS 101757), or audio interfaces for transmission 
within the studio region (IEC 60958), (IEC 61937). 

Data services 

Owing to the special needs for data services in DAB a new standard for Multimedia Object 
Transfer (MOT) was created defining the specific transport encoding of data types not 
specified in (EN 300401) and ensuring interoperability between different data services and 
application types or equipment of different providers or manufacturers. Additional 
guidelines are given in the “MOT Rules of operation” (TS 101497), “Broadcast Web-site 
application” (TS 101498) or “Slide show application” (TS 101499). An important bridge 
between the well-known radio data service RDS for FM (EN 50067) and the data services 
in DAB is provided by the European Norm (EN 301700) which defines service referencing 
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from FM-RDS and the use of RDS-ODA (open data applications). A transparent data 
channel for DAB transmission is described in (TS 101759). 

Network and transmission standards 

Based on the DAB main standard (EN 300401) additional standards are given to define the 
DAB multiplex signal formats for distribution (networking) and transmission. This is the 
so-called Service Transport Interface (STI) (EN 300797) for contribution networks between 
service providers and broadcast studios, the Ensemble Transport Interface (ETI) 
(EN 300799), and the Digital baseband In-phase and Quadrature (DIQ) interface 
(EN 300798) for DAB channel coding using OFDM modulation. 

To provide interactive services, transport mechanisms for IP Datagram Tunnelling 
(ES 201735), Network Independent Protocols (ES 201736) and Interaction Channel 
Through GSM/PSTN/ISDN/DECT (ES 201737) are defined. 

Receiver requirements 

Based on the DAB main standard (EN 300401) additional standards are given to define the 
implementation of DAB receivers. (EN 50248) describes DAB receiver characteristics for 
consumer equipment for terrestrial, cable and satellite reception. EMC parameters for 
receivers are identified in EACEM TR-004. (EN 50255) specifies the Radio Data Interface 
(RDI) between DAB receivers and peripheral data equipment. A special command set to 
control receivers is described in (EN 50320). (TR 101758) lists general field strength 
considerations for a DAB system. 

Guidelines for implementation and operation 

Based on the main standards, more detailed guidelines and rules for implementation and 
operation are compiled in (TR 101296) as a main guideline document for service providers 
and manufacturers. A guide to standards, guidelines and bibliography is given in the ETSI 
Technical Report (TR 101495). A broadcaster’s introduction to the implementation of some 
key DAB system features (BPN 007) is provided by the EBU. 



1.6 Relations to Other Digital Broadcasting Systems 

In addition to the European DAB system which is mainly covered by this book, several 
other digital sound broadcasting services exist which have been or are being developed and 
(partly) introduced. 

These systems differ in many aspects (specifications, service complexity, parameters) 
from the DAB concept. Some are focused more strongly on a single application or service 
(e.g. stationary reception) or provide lower audio quality levels (such as WorldSpace or 
DRM). Except for ISDB-T, which uses technologies very similar to DAB, all other systems 
are not expected to be able to comply with the audio quality and quality of service in 
mobile reception provided by DAB. Nevertheless the basic concepts and limitations of 
some of these systems will be introduced briefly. 

Also, there are a few cable-based digital radio services, which are mainly derived from 
existing satellite or terrestrial radio systems (for instance, ADR, DVB, etc.) so it may not be 
necessary to describe them separately. In general, those systems use QAM schemes in order 
to achieve high data rates in the limited bandwidth of the cable. This is possible because of 
the high signal-to-noise ratio available. However, these services have only local 
importance, depending of the extension of the broadband cable distribution network used. 
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1.6.1 Satellite-based Digital Radio Systems 

Although the DAB system can also be used for satellite transmissions, a number of 
different proprietary systems have been designed by companies providing direct 
broadcasting satellite services. Of course, many radio programmes are transmitted via 
telecommunication satellites to feed relay transmitters or cable head stations. These systems 
are beyond the scope of this section. 

Generally, the system layout in the direct broadcasting systems focuses either on 
stationary reception with highly directive antennas (e.g. “dishes”) in the 1 1 GHz range (e.g. 
ADR) or on portable and possibly mobile reception in the L-band (1.5 GHz) and S-band 
(2. 3/2. 5/2.6 GHz) allocations (e.g. WorldSpace, XM and Sirius). 

Historically, the first system for the direct satellite broadcasting of digital radio services 
was Digital Satellite Radio (DSR). In contrast to later systems, no sound compression was 
used but only a slight reduction in data rate as compared to CD by using a scale factor. This 
service has recently been closed down because of the end of the lifetime of the satellite 
used (Kopemikus). Therefore, it is not described here. More details can be found in 
(Schambeck, 1987). 

The basic building blocks of modem satellite systems are audio coding, some kind of 
multiplexing, and modulation. While there are things in common with DAB with respect to 
the first two aspects, these systems do not use OFDM modulation because of the non-linear 
travelling wave tube amplifiers on satellites and the low spectral efficiency. Instead, often 
relatively simple PSK schemes are used, which are considered sufficient because, when 
using directional receiving antennas, multipath propagation does not occur in satellite 
channels. 

Satellite systems face severe problems when the service is to be extended to mobile 
reception. Because only line-of-sight operation is possible owing to the limited power 
available in satellite transmissions, at higher latitudes, where geostationary satellites have 
low angles of elevation, either terrestrial retransmission in cities and mountainous regions is 
necessary, or the service has to be provided by several satellites at different azimuths in 
parallel to fill shaded areas. Another approach is to use satellites in highly inclined elliptical 
orbits which always show high elevation angles. However, this requires several satellites in 
the same orbit to provide continuous service and difficult switching procedures have to be 
performed at hand-over from the descending to the ascending satellite. 



1. 6.1.1 Astra Digital Radio 

The Astra Digital Radio (ADR) system was designed by Societe Europeenne des Satellites 
(SES), Luxembourg, to provide digital radio on its geostationary direct TV broadcasting 
satellites. This co-positioned family of satellites labelled “ASTRA” covers Central Europe 
operating in the 11 GHz range. Brief system overviews are given by (Hofmeir, 1995) or 
(Kleine, 1995). A detailed description of the system is available at the ASTRA web-site 
(www. ASTRA). 

The system design is compatible with the sub-carrier scheme for analogue sound 
transmission within TV transponders. It uses MPEG Layer II sound coding (as does DAB) 
at a fixed bit rate of 192 kbit/s for a stereo signal. Forward error correction is applied using 
a punctured convolutional code with code rate 3/4. The data are differentially encoded to 
provide easy synchronisation. QPSK modulation is used for each of up to 12 ADR sub- 
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carriers which are combined with the analogue TV signal and are FM modulated. The 
baseband bandwidth of the digital signal is 130 kHz which is the same as for an analogue 
sound sub-carrier. This allows existing analogue radio services to be replaced by the digital 
ADR service one by one. It is also possible to use a whole transponder for ADR only. In 
this case, 48 channels can be accommodated in one transponder. 

Additional data can be sent together with the audio signal. ADR uses the auxiliary data 
field of the MPEG audio frame to do this, but the coding is different from the DAB 
Programme Associated Data (PAD; see Chapter 2). ADR uses 252 bits per audio frame for 
this purpose, which results in a net data rate of about 6 kbit/s, because a (7,4) block code is 
used for error correction. This capacity is flexibly split into “RDS data”, “Ancillary Data” 
and “Control Data”. RDS data are coded in the format of the RDS (EN 50067) which is 
used on a digital sub-carrier on FM sound broadcasts and provides service-related 
information such as labels and Programme Type (see also section 2.5). The reason for using 
this format on ADR is that several broadcasters use ADR to feed their terrestrial FM 
networks. Hence these data can be extracted from the ADR signal and fed to the RDS 
encoder of the FM transmitter. Ancillary data are used by the broadcaster for in-house 
purposes. The control data field contains a list of all ADR channels to provide easy tuning 
and allows transmission of parameters for conditional access if applied. 

With respect to DAB, ADR provides the same quality of audio, because the same 
coding mechanism is used. The number of channels available, however, is much larger. 
While DAB currently only provides six to seven services in each network, ADR can 
provide 12 digital radio channels on each transponder in addition to a TV channel. 
Therefore, several hundred channels are available and in use by public and private 
broadcasters from all over Europe. ADR decoders are integrated in high-end satellite TV 
receivers and are therefore widespread. The most important drawback of ADR is that it can 
only be received with stationary receivers using a dish antenna. The flexibility of the 
system with respect to data transmission and future multimedia extensions is considerably 
lower than that of DAB. 



1.6. 1.2 WorldSpace 

The WorldSpace digital radio system was designed by the US company WorldSpace Inc., 
to provide digital radio to developing countries on its three geostationary satellites named 
“CaribStar” covering Central and South America, “AfriStar” covering Africa, Southern 
Europe and the Near and Middle East, and “AsiaStar” covering India, China, Japan and 
South East Asia. The system downlink operates in the L-band. The standard is not public. A 
description is given by (Sachdev, 1997), for example. 

The basic building blocks of the systems are “Prime Rate Channels” (PRCs) which each 
transmit 16 kbit/s of data. Several of these (typically up to eight, resulting in 128 kbit/s) can 
be combined to provide a stereo sound channel or data channels. For audio coding, MPEG 
Layer III is used. 

Each satellite serves three coverage areas (spots). For each spot, two multiplex signals 
containing 96 PRCs (i.e. 12 stereo channels) each in a time domain multiplex (TDM) are 
available. Each of these data streams is QPSK modulated. One multiplex is completely 
assembled in a ground station and linked up to the satellite from which it is retransmitted 
without further processing. The other multiplex is processed on board the satellites by 
assembling individual PRC uplinks to the TDM signal. The advantage of this is that there 
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are no additional costs to transport the signal from the broadcaster to the uplink station. 
Moreover, a signal can be broadcast in several spots with only one uplink. 

Owing to the lower path loss of the L-band signals as compared to 1 1 GHz, the 
WorldSpace system is able to provide significant field strength within the coverage area. 
Therefore reception is possible with portable receivers outdoors and inside buildings using 
a low-gain patch or helical antenna at a window where line-of-sight conditions apply. 
Minor attenuation by foliage or trees may be tolerable. 

At the time of writing the AfriStar and AsiaStar satellites are operational and several 
receiver manufacturers are beginning to market receivers. 

With respect to DAB, WorldSpace addresses a completely different broadcasting 
environment. Clearly, in developing countries digital satellite radio will considerably 
improve the number and quality of broadcast receptions, because in many parts up to now 
there is only short-wave coverage. Mobile reception is probably not an urgent need and 
hence the system is not particularly designed for it. 

From the point of view of audio quality, by using MPEG Layer III at 128 kbit/s the 
quality will be comparable to that of DAB at 160-192 kbit/s. At this bit rate, however, the 
number of channels per spot is restricted to 24. Owing to the large coverage areas this 
number may not be sufficient to serve the multilingual and multiethnic audience. Hence, the 
audio bit rate will have to be reduced in order to increase the number of channels available. 
Nevertheless the sound quality will in many cases still be comparable at least to mono FM, 
which is still a major improvement on short-wave AM. 

Utilising the concept of PRCs the system is very flexible and can be extended to data 
transmission and multimedia contents. This is important because the lifetime of the 
satellites is 15 years. 



1.6.1.3 Satellite Systems in the United States 

In the United States there is no allocation for digital radio in the L-band, but in the S-band. 
Currently, two companies are in the process of establishing satellite services targeted at 
mobile reception and intended to cover the whole of the continental United States (i.e. 
except Alaska and Hawaii). Sirius Satellite Radio intends to use three satellites on inclined 
orbits to achieve high elevation angles and terrestrial repeaters in metropolitan areas, see 
(www.Siriusradio). The other company, XM Satellite Radio, will use two high-powered 
geostationary satellites at 85° and 115° which both cover the entire continental United 
States and thus provide spatial diversity to overcome shading, see (www.XMRadio). 
Nevertheless terrestrial retransmitters will also be needed. 

At the time of writing, the first satellite of Siriusradio has been launched successfully. 
The systems will be available in 2001 and then provide approximately 100 audio channels 
in a quality comparable to CD. The services will be pay-radio based on a monthly 
subscription fee. Both companies recently agreed to develop a unified standard enabling the 
same receiver to access both systems. 

With respect to sound quality these systems should be comparable to DAB, and with 
respect to reliability of service these systems should be comparable to WorldSpace; 
however, because of the measures taken they should show better performance in mobile 
and portable reception. 
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1 . 6. 2 Terrestrial Digital Broadcasting Systems 

1.6.2. 1 Digital Video Broadcasting - Terrestrial (DVB-T) 

The DVB system was developed a few years after the DAB system. On the audio and video 
coding level as well as on the multiplexing level, it is completely based on the MPEG-2 
standard. This is in contrast to the DAB standard, where the audio coding is MPEG, but the 
multiplex control is independent and specially adapted for the requirements. There are three 
DVB standards that differ very much in the transmission scheme: DVB-S for satellite (EN 
300421), DVB-C for cable (EN 300429), and DVB-T for terrestrial broadcasting (EN 
300744). Here we consider only the last one. For a detailed description of all three 
standards, see (Reimers, 1995). 

The DVB-T standard has been designed for stationary and portable terrestrial reception 
with multipath fading. In contrast to DAB, mobile reception was not required. On the 
coding and modulation level, many methods have been adopted from the DAB system, 
where they had already been implemented successfully. 

Like DAB, DVB-T uses OFDM with a guard interval. There are two transmission 
modes. The first is called 8K mode (because it uses an 8192-point FFT) and has a symbol 
length of the same order as in DAB transmission mode I. It is suited for single frequency 
networks. The second is called 2K mode (because it uses a 2048-point FFT) and has a 
symbol length of the same order as in DAB transmission mode II. It is not suited for single 
frequency networks. The guard interval can be chosen to be 25% of the total symbol length, 
as for DAB, but also shorter guard intervals are possible. The total bandwidth of about 
7.6 MHz is suited for terrestrial 8 MHz TV channels. The system parameters can be scaled 
for 7 MHz TV channels. A major difference from the DAB system is that DVB-T uses 
coherent modulation. Different signal constellations between QPSK and 64-QAM are 
possible. At the receiving side, the channel amplitude and phase have to be estimated. For 
channel estimation, nearly 10% of the total bandwidth is needed for pilot symbols. The 
coherent modulation scheme with channel estimation by a Wiener filter is more advanced 
than the differential demodulation scheme of DAB. It is even more robust against fast 
fading in mobile reception situations (Hoeher, 1991a), (Schulze, 1998), (Schulze, 1999). 

The OFDM transmission scheme for DVB-T includes a frequency interleaver that 
consists of a simple pseudo-random permutation of the sub-carrier indices. It is very similar 
to the DAB frequency interleaver. In contrast to DAB, the DVB-T system has no time 
interleaving. Time interleaving only makes sense for mobile reception, which was not a 
requirement for the design of the system. 

The channel coding is based on the same convolutional codes like the DAB system. 
Code rates between 1/2 and 7/8 are possible. Unequal error protection (UEP) is not possible 
for the DVB system, since the data and transmission level are completely separated. As a 
further consequence, in contrast to DAB, the whole multiplex has to be coded with the 
same code rate. To reach the very low bit error rates that are required for the video codec, 
an outer Reed-Solomon (RS) code is applied: blocks of 188 bytes of useful data are coded 
into blocks of 204 bytes. Between the outer RS code and the inner convolutional code, a 
(relatively) short byte interleaver has been inserted to break up the burst errors that are 
produced by the Viterbi decoder. 
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The data rates that can be carried by the DVB-T multiplex vary from about 5 Mbit/s to 
about 30 Mbit/s, depending on the modulation (QPSK, 16-QAM or 64-QAM), the code rate 
(1/2, 2/3, 3/4, 5/6 or 7/8) and the guard interval. 

Mobile reception of DVB-T, even though not a specific design feature, has been widely 
discussed and gave rise to many theoretical investigations and practical experiments. 
Furthermore, there seems to be a controversy between DAB and DVB adherents that 
concentrates very much on this question. Unfortunately, even though both systems are very 
close together, the chance has been missed to design a universal common system. From the 
technical point of view, we will list some comments on the question of mobile reception: 

1 . Unlike one might expect, the differential modulation of DAB is not more robust than 
the coherent modulation for DVB even for severely fast-fading mobile reception. The 
coherent scheme is more robust and spectrally more efficient, see (Schulze, 1999). 

2. For mobile reception in a frequency flat fading situation, the frequency interleaving 
even over a relatively wide band of 8 MHz is not sufficient. As a consequence, the lack 
of time interleaving may severely degrade the system in some mobile reception 
situations, see (Schulze, 1998). 

3. Synchronisation losses may sometimes occur for mobile receivers. In contrast to DVB- 
T, this was taken into account in the design of the DAB system: the null symbol allows 
a rough frame and symbol synchronisation every 24 ms or a maximum of 96 ms. The 
FIC can then immediately be decoded and provides the information about the multiplex 
structure. And, in the (typical) stream mode, the data stream does not need separate 
synchronisation because it is synchronised to the physical frame structure. 

4. Bit errors in the scale factors of the MPEG audio data ( birdies ) are very annoying for 
the human ear. For a subjective impression this is much worse than ,say, block error in 
a picture. The DAB system uses an extra CRC (that is not part of the MPEG bit- 
stream) to detect and to conceal these errors. This is not possible for DVB-T. 

As a consequence, the DVB-T system allows mobile reception in many typical 
environments, but it may fail in some situations where more care has been taken in the 
DAB system. 

1.6. 2. 2 In-Band-On-Channel Systems 

In the United States the broadcasting environment is considerably different from the 
European one. Technical parameters of the stations (AM or FM, power, antenna height) 
resulting in different coverage areas and quality of audio service appear to be important 
economic factors that are vital to the broadcasters and which many of them think have to be 
preserved in the transition to a digital system. Although the DAB system can provide such 
features to a certain extent by applying different audio data rates and coding profiles, the 
idea of multiplexing several stations into one transmitting channel does not seem to be very 
attractive to some US broadcasters. 

In the early 1990s, a number of US companies therefore started to work on alternative 
approaches to digital audio radio (DAR) broadcasting known as IBOC (In-Band-On- 
Channel) and IBAC (In-Band- Adjacent-Channel). Systems were developed both for AM 
and FM stations in MW and VHF Band II. The basic principle is to use low-power digital 
sideband signals within the spectrum mask of the channel allocated to the station. 
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Later in the 1990s a number of IBOC/IBAC systems and the Eureka 147 DAB system 
and a satellite-based system (VOA/JPL) were tested in both the laboratory and field in the 
San Francisco area. The results of these tests were not encouraging for the IBOC/IBAC 
approaches, see (Culver, 1996), stating explicitly that “of all systems tested, only the 
Eureka 147/DAB system offers the audio quality and signal robustness performance that 
listeners would expect from a new DAR service”. Since then work on IBOC systems has 
nevertheless continued. At the time of writing (mid 2000), the US National Radio Systems 
Committee (NRSC) is starting a new testing procedure, to which the companies USA 
Digital Radio and Lucent will contribute IBOC system proposals called iDAB and LDR. 

However, independent of the individual system details there are two principle problems 
with the FM IBOC approach which are difficult to overcome. 

Owing to the fact that the IBOC signal is limited in bandwidth to the FCC channel 
spectrum mask, the bandwidth is not sufficient to overcome frequency-selective fading. 
This means that in portable or mobile reception a fade of the whole signal can occur. In this 
case there is hardly a chance to reconstruct the signal even if strong error correction codes 
and “long” interleaving are used (think of stopping at a traffic light). Therefore, from the 
point of view of coverage, IBOC systems are not expected to be superior to FM, because 
they will fail in the same locations as FM does. 

Every IBOC signal will to a certain extent impair the analogue signal in the same 
channel. The effect of this is additional noise on the analogue audio signal and this may 
affect different analogue receiver circuits in different ways. Therefore the broadcaster has 
no control of what the digital signal will do to all the different analogue receivers being 
used. To keep this noise sufficiently low, the digital signal level must be kept far below the 
level of the analogue signal. Although the digital system requires less signal-to-noise ratio 
than FM, this means that the coverage of IBOC will be very limited. 

1.6.2.3 Integrated Services Digital Broadcasting (Japan) 

In Japan, the NHK Science and Technical Research Laboratories have proposed a concept 
called Integrated Services Digital Broadcasting (ISDB). From this approach a system was 
created which can be configured for terrestrial and satellite broadcasts of radio, TV and 
multimedia services, see (Kuroda, 1997), (Nakahara, 1996). For the terrestrial system 
(ISDB-T) the most important aims are rugged reception with portable and mobile receivers, 
use of Single Frequency Networks (SFNs) to achieve frequency efficient coverage and to 
have a flexible scheme in order to be future-proof. To achieve this, a variant of OFDM (see 
section 2.2) was developed which is called Band Segmented Transmission (BST)-OFDM 
and means that each signal consists of a number of basic OFDM building blocks called 
segments with a bandwidth of 432 kHz which are spread over the band where frequencies 
are available. The OFDM parameters of the systems are similar to transmission mode IV of 
the DAB system because the system is designed for the UHF range. 

A single 432 kHz segment is sufficient to broadcast audio programmes and data, but it 
is also possible to combine three segments for this purpose. For TV signals 13 segments are 
combined to form a 5.6 MHz wide signal. In all cases the basic modulation and coding 
modes of the segments are (D)QPSK, 16QAM and 64QAM, and several codes rates 
between 1/2 and 7/8 are available. In the 5.6 MHz mode the individual segments may be 
coded and modulated in different ways to achieve hierarchical transmission capacity. For 
the audio and video coding the MPEG-2 standard will be used. 
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In 1998 draft standards on digital terrestrial television and radio based on the ISDB-T 
concept were established, see (STRL, 1999), and field tests are being performed. 

From the point of view of relation to DAB, ISDB-T basically uses the same principles 
and hence provides similar features as do DAB and DVB-T, respectively. By to using the 
MPEG-2 standard for coding, the audio and video quality will be comparable. Owing to the 
similarity of the OFDM parameters used, and similar bandwidth (at least when three 
segments are combined for audio), the coverage properties will also be similar to those of 
DAB. The advantage of ISDB-T is that the radio and TV systems are based on the same 
basic building blocks (although the coding and modulation will be different), which allows 
for reuse of some circuits in receivers, whereas DAB and DVB-T differ in many details. 
Another advantage of ISDB-T is the availability of a narrow-band mode (i.e. a single 
segment) which can deliver one or two radio programmes to small regions or communities, 
although potentially with some reception problems due to flat fading. With this 
segmentation ISDB-T can also flexibly use the frequencies not occupied by analogue 
services in the introductory simulcast phase. Such possibilities are not present in DAB. 



1.6.3 Digital Radio in the Broadcasting Bands below 30 MHz 

The frequency range below 30 MHz (short, medium, long wave) offers the possibility to 
provide a very large coverage area for broadcasting with only one high-power transmitter 
and very simple receivers with no directional antennas. Using this frequency range is the 
easiest - and often practically the only possible - way to cover large countries with low 
technical infrastructure. On the other hand, traditional broadcasting in this frequency range 
uses the most antiquated and simplest possible transmission scheme: double-sideband AM 
with a carrier. From one point of view this is a waste of power and bandwidth efficiency in 
such a valuable frequency range at a time when every year more and more efficient 
communication systems are being designed. From a practical point of view, this old 
technique provides only very poor audio quality and thus finds acceptance only in regions 
where no other service is available. 

The obvious need for a more advanced system working in this frequency range gave rise 
to a consortium called Digital Radio Mondiale (DRM) to develop a world-wide standard 
for such a system. At the time of writing (mid 2000) the work is progressing to complete 
the specification and to go on with field tests. Although there are still some parameters to 
be defined, the basic structure of the system seems to be fixed now. 

The available bandwidth in this frequency range is typically very small: the usual 
channel spacing is 5, 9 or 10 kHz. For the new system, it should be possible to bundle two 
5 kHz channels to one 10 kHz channel. For such narrow-band channels, much more data 
reduction is required than for the DAB system. The Advanced Audio Coding (AAC) data 
reduction scheme of MPEG allows an audio quality comparable to FM at a data rate 
between 20 and 25 kbit/s, see for example (Dietz, 2000). This would be a great 
improvement compared to conventional AM quality. But bearing in mind that the DAB 
system typically transmits little more than 1 Mbit/s (e.g. six audio programmes of 
192 kbit/s) in a 1.5 MHz bandwidth, even this low data rate is relatively high compared to 
the extremely small bandwidth of 10 kHz. This means that the DRM transmission scheme 
must provide more than twice the bandwidth efficiency compared to DAB. 
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The physical channel depends on the type of wave propagation, which is very special in 
this frequency range. It may differ from night to day and it may depend on the solar 
activity. Also interference caused by human noise plays an important role. In situations 
where only ground-wave propagation is present, there is a simple white Gaussian noise 
channel that is relatively easy to handle. But typically the wave propagation is dominated 
by ionospheric scattering (sky wave). This means that the channel is time and frequency 
variant - just like the mobile DAB channel (see section 2.1). Travel time differences of the 
wave of the order of 5 ms may occur which cause severe frequency-selective fading. The 
motion of the ionosphere causes a time variance in the channel with a Doppler spectrum 
with a typical bandwidth of the order of 1 Hz. 

The DRM channel has similar problems in time- and frequency-selective fading as the 
DAB channel, so the DRM group decided on the same solution: OFDM (see section 2.1). 
The system parameters are not finally fixed at the time of writing. A possible choice that 
may be suited for sky wave propagation could be an OFDM symbol duration of 27 ms and 
a guard interval of 5.3 ms. With this symbol length, the OFDM symbol would consist of 
approximately 200 sub-carriers. The requirement of a very high-bandwidth efficiency leads 
to a much higher signal constellation for each sub-carrier than used for DAB. It turned out 
that 64-QAM is a good choice. Because of the very large time variance of the channel, a 
large amount of the total bandwidth (about 1/6) will be needed for channel estimation 
pilots. It turns out that an overall code rate between 1/2 and 2/3 will be needed for the 
required data rate. It can be shown (Schulze, 1999) that such a coded 64-QAM OFDM 
system with a good channel estimator is surprisingly robust in a fading channel even with 
conventional 64-QAM symbol mapping. For DRM it has been decided to use the multilevel 
coding approach first proposed by (Imai, 1977). This approach is very similar to that 
described by (Worz, 1993) for a coded 8-PSK system. With multilevel 64-QAM, an 
additional gain of up to 1 or 2 dB can be reached compared to conventional 64-QAM. DRM 
is the first system that implements such a coded modulation scheme, see also (Dietz, 2000). 
The same convolutional codes as DAB will be used here. 

For ground mode propagation, another transmission mode may be defined, because less 
channel estimation is needed, the guard interval may be shorter, and even weaker codes can 
be used in a Gaussian channel. 

Channel coding must be supported by interleaving to work in a fading channel. 
Frequency interleaving alone is not sufficient, especially if the echoes are too short or if 
only a two-path propagation situation occurs. Time interleaving is restricted by the delay 
that is allowed by the application: after the receiver has been switched on the listener 
cannot wait too long for a signal. A delay of 2 seconds is regarded as an upper limit. Since 
the time variance is very slow (1 Hz Doppler spread), the interleaving will not be sufficient 
in many situations. 



1. 6.4 Web-casting 

A completely different way to provide radio services is the use of the Internet as a universal 
multimedia platform. 

The basic equipment for an Internet radio customer is a PC, complete with a sound 
adapter, a rather fast modem and Internet access, possibly via ISDN or ASDL. 
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The availability of improved and scalable audio coding schemes, on the other hand, is 
creating the opportunity for a better definition of audio quality obtainable via the Web. The 
Internet’s impact on broadcasters has changed from its initial position as a site for posting 
information related to the station and programme to an additional medium to reach new 
customers through streaming audio, live or on-demand, still and moving pictures, and new 
interactive concepts. The importance of Internet audio services, which are using only a 
small part of the capacity of the net, is already significantly increasing audio information 
delivered via IP. Several software companies have produced the necessary tools to 
distribute audio and video via the Internet with the goal to develop the Web as a new mass 
medium, similar to the traditional audio and video broadcasting services. 

With the current streaming techniques and 56 kbit/s modems an audio quality can be 
achieved surpassing by far that of MW broadcasting with audio bit rates which are reduced 
by a factor of 50, compared to those used with the CD. 

For both, either for live streaming or downloading of audio clips, there are several audio 
players (software codecs) in use which can provide sufficient audio quality in the range 
mentioned below, such as 

• Microsoft Windows Media 4 

• RealNetworks G2 

• Q-Design Music Codec 2 together with Quicktime streaming 

• MPEG-2/-4 AAC “Advanced Audio Coding” 

• MP2, which is MPEG-1/-2 Layer II 

• MP3 (close to MPEG-1/-2 Layer III) 

• Yamaha SoundVQ. 

With this variety of audio players already on the market (mostly for free!) it may be 
difficult to reach international standardisation in this field. Nevertheless, web-casting will 
have its own place in the radio world because of the access to hundreds of radio 
programmes world-wide. 

Much more detailed information on web-casting and its audio quality expectations is 
contained in (BPN 022) and (Stoll, 2000). 
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2.1 The Physical Channel 

Mobile reception without disturbance was the basic requirement for the development of the 
DAB system. 

The special problems of mobile reception are caused by multipath propagation: the 
electromagnetic wave will be scattered, diffracted, reflected and reaches the antenna in 
various ways as an incoherent superposition of many signals with different travel times. 
This leads to an interference pattern that depends on the frequency and the location or - for 
a mobile receiver - the time. 

The mobile receiver moves through an interference pattern that changes within 
microseconds and that varies over the transmission bandwidth. One says that the mobile 
radio channel is characterised by time variance and frequency selectivity. 

The time variance is determined by the vehicle speed v and the wavelength X ~ df$ , 
where / 0 is the transmission frequency and c the velocity of light. The relevant physical 
quantity is the maximum Doppler frequency shift: 



fo 




1 fo V 
1080 MHz km/h 



( 2 . 1 ) 



Table 2.1 shows some practically relevant figures for //) max . 
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Table 2.1 Examples for Doppler frequencies 



/Dmax 


v=48 km/h 


v-96 km/h 


v=192 km/h 


/ 0 = 225 MHz 


10 Hz 


20 Hz 


40 Hz 


/ 0 = 900 MHz 


40 Hz 


80 Hz 


160 Hz 


/q=1500 MHz 


67 Hz 


133 Hz 


267 Hz 



The actual Doppler shift of a wave with angle a relative to the vector of the speed of the 
vehicle is given by 



fo = /dmx cos(a). (2.2) 

Typically, the received signal is a superposition of many scattered and reflected signals 
from different directions so that we may speak not of a Doppler shift but of a Doppler 
spectrum. Figure 2.1 shows an example of a received VHF signal level for a fast-moving 
car (v = 192 km/h) as a function of the time for a carrier wave of fixed frequency 
f 0 = 225 MHz. 




Figure 2.1 Time variance of multipath fading 
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Figure 2.2 Time variance as a curve in the complex plane 

The superposition of Doppler-shifted carrier waves leads to a fluctuation of the carrier 
amplitude and the phase. This means the received signal has been amplitude and phase 
modulated by the channel. Figure 2.2 shows the trace of the phasor in the complex plane. 
For digital phase modulation, these rapid phase fluctuations cause severe problems if the 
carrier phase changes too much during the time T s that is needed to transmit one digitally 
modulated symbol. Amplitude and phase fluctuate randomly. The typical frequency of the 
variation is of the order of/^ max . Consequently, digital transmission with symbol time T s is 
only possible if 



fomax^S << 1 ‘ ( 2 . 3 ) 

The frequency selectivity of the channel is determined by the different travel times of the 
signals. They can be calculated as the ratio between the travelling distances and the velocity 
of light. Table 2.2 shows some typical figures. 



Table 2.2 Examples for travel times of the signal 



Distance 


300 m 


3 km 


30 km 


Time 


1 JUS 


10 ps 


100 ps 
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Travel time differences of some microseconds are typical for cellular mobile radio. For 
a broadcasting system for a large area, echoes up to 100 ps are possible in a hilly or 
mountainous region. In so-called single frequency networks (see chapter 7) the system must 
cope with even longer echoes. Longer echoes correspond to more fades inside the 
transmission bandwidth. Figure 2.3 shows an example of a received signal level as a 
function of the frequency at a fixed location where the travel time differences of the signals 
correspond to several kilometres. In the time domain, intersymbol interference disturbs the 
transmission if the travel time differences are not much smaller than the symbol duration 
7\ A data rate of 200 kbit/s, for example, leads to T s = 10 ps for the QPSK (Quaternary 
Phase-Shift Keying) modulation. This is of the same order as the echoes. This means that 
digital transmission of that data rate is not possible without using more sophisticated 
methods. Known techniques are equalisers, spread spectrum and multicarrier modulation. 
Equalisers are used in the GSM standard. The data rate for DAB is much higher than for 
GSM and the echoes in a broadcasting scenario are much longer than in a cellular network. 
This would lead to a higher complexity for the equaliser. Spread spectrum is spectrally 
efficient only for cellular networks where it is used as multiple access (CDMA), as in the 
UMTS standard. For DAB it was therefore decided to use multicarrier modulation, because 
it is able to cope with very long echoes and it is easy to implement. 



Level 

[dB] 




Figure 2.3 Frequency selectivity of multipath fading 

For a more detailed treatment of the mobile radio channel and transmissions techniques, 
we refer to textbooks like (Proakis, 1995), (Kammeyer, 1996) and (David, 1996). 
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2.2 The DAB Transmission System 

2.2 . 1 Multicarrier Modulation 

To cope with the problem of intersymbol interference caused by long echoes, DAB uses a 
special type of multicarrier modulation: OFDM (Orthogonal Frequency Division 
Multiplex), The simple idea behind multicarrier modulation is to split up the high-rate data 
stream into K parallel data streams of low data rate and to modulate each of them separately 
on its own (sub-)carrier. This leads to an increase of the symbol duration T s by a factor of 
K. For sufficiently high K , it is possible to keep T s significantly longer than the echo 
duration and to make the system less sensitive to intersymbol interference. 

OFDM is a spectrally very efficient kind of multicarrier modulation, because it 
minimises the frequency separation between the individual carriers by allowing some 
controlled spectral overlap between the carriers, without causing adjacent channel 
interference (ACI). This goes back to the mathematical property of orthogonality that gave 
the name to OFDM. 

It is easy to understand an OFDM signal s(t) as a kind of signal synthesis by a finite 
Fourier series defined by 



s(t)= Y,z k -e J2MIT . (2.4) 

k——K / 2 

It is defined on an interval (Fourier period) of length T. The complex Fourier coefficients z k 
carry the digitally coded information. For each time interval of length T, another set of K + 1 
information-carrying coefficients can be transmitted. In practice, the DC coefficient for 
k = 0 will not be used (i.e. is set to zero) for reasons of hardware implementation. The 
Fourier synthesis can be interpreted as a modulation of each complex modulation symbol z k 
on a complex carrier wave exp (j2nktlT) with frequency k/T (k-± 1, ±2,..., ±K/2). The 
signal s(t) is the complex baseband signal and has to be converted to an RF signal by means 
of a quadrature modulator. At the receiver side, Fourier analysis of the downconverted 
complex baseband signal will produce the complex symbols using the well-known formula 

z k =-)e- JUk,IT s(t)dt, ( 2 - 5 ) 

T 0 

which results from the orthogonality of the carrier waves. Both Fourier analysis and 
synthesis will be implemented digitally by the FFT (Fast Fourier Transform) and IFFT 
(Inverse FFT) algorithms. The transmission chain is shown in Figure 2.4. 

The part of the OFDM signal that transmits the K complex coefficients z k is called the 
OFDM symbol. 

To make the transmission more robust against long echoes, the OFDM symbol period T s 
will be made longer than the Fourier period T by a so-called cyclic prefix or guard interval 
of length A simply by cyclic continuation of the signal. A synchronisation error smaller 
than A will then only lead to a frequency-dependent but constant phase shift. Echoes are 
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Physical 
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Figure 2.4 FFT implementation of OFDM 

superpositions of ill-synchronised signals and will cause no intersymbol interference, but a 
constant phasor, as long as the delays are smaller than A. For DAB, differential quatrature 
phase shift keying (DQPSK) is used so that this constant phase cancels out at the 
demodulator. 

The length of T s is limited by the requirement that the phase fluctuations must be small, 
that is 

fomaJ's <<: 1 • ( 2 - 6 ) 

On the other hand, long echoes require a long guard interval and a long T s . To keep the 
system flexible for different physical situations, four Transmission Modes (TMs) with 
different parameter sets have been defined, see Table 2.3. 

Table 2.3 The OFDM parameters for the four DAB transmission modes 



Mode 


K 


1 IT 


Ts 


A 


Max. frequency 


TMI 


1536 


1 kHz 


=1246 ps 


=246 ps 


=375 MHz 


TM II 


384 


4 kHz 


=312 ps 


=62 ps 


=1.5 GHz 


TM III 


192 


8 kHz 


=156 ps 


=31 ps 


=3 GHz 


TM IV 


768 


2 kHz 


=623 ps 


=123 ps 


=750 MHz 



The product of the number of sub-carriers K and the spacing 1 IT between them is the 
same for all transmission modes and determines the total signal bandwidth of approx. 1.5 
MHz. The parameters of all transmission modes can be easily scaled into each other. The 
ratio A IT is always the same. The last column in the table gives a rule of thumb for the 
maximum transmission frequency due to the phase fluctuation caused by the Doppler 
effect. A car speed of 120 km/h and a physical channel with no line of sight (the so-called 
isotropic Rayleigh channel, see (David, 1996)) has been assumed. 

Transmission mode I with the very long guard interval of nearly 250 ps has been 
designed for large-area coverage, where long echoes are possible. It is suited for single 
frequency networks with long artificial echoes; 200 ps correspond to a distance of 60 km, 
which is a typical distance between transmitters. If all transmitters of the same coverage 
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area are exactly synchronised and send exactly the same OFDM signal, no signal of 
relevant level and delay longer than the guard interval will be received. Since the OFDM 
symbol length Ts is very long, transmission mode I is sensitive against rapid phase 
fluctuations and should only be used in the VHF region. 

Transmission mode II can cope with echoes that are typical of most topographical 
situations. However, in mountainous regions, problems may occur. This mode is suited for 
the transmission in the L-band at 1 .5 GHz. 

Transmission mode III has been designed for satellite transmission. It may be suited 
also for terrestrial coverage, if no long echoes are expected. 

The parameters of TM IV lie just between mode I and II. It was included later in the 
specification to take into account the special conditions of the broadcasting situation in 
Canada. It will be used there even at 1.5 GHz. This is possible because of limited speed and 
direct line of sight. 



2.2.2 The Frame Structure of DAB 

For each transmission mode, a transmission frame is defined on the physical signal level as 
a periodically repeating structure of OFDM symbols that fulfil certain tasks for the data 
stream. It is an important feature of the DAB system (and in contrast to the DVB system) 
that the time periods on the physical level and on the logical (data) level are matched. The 
period T F of the transmission frame is either the same as the audio frame length of 24 ms or 
an integer multiple of it. As a consequence, the audio data stream does not need its own 
synchronisation. This ensures a better synchronisation stability especially for mobile 
reception. 

The structure for TM II is the simplest and will thus be described first. The frame length 
is 24 ms. The first two OFDM symbols of the transmission frame build up the 
Synchronisation Channel (SC). The next three OFDM symbols carry the data of the Fast 
Information Channel (FIC) that contains information about the multiplex structure and 
transmitted programmes. The next 72 OFDM symbols carry the data of the Main Service 
Channel (MSC). The MSC carries useful information, such as audio data or other services. 
Figure 2.5 shows the transmission frame structure. It is also valid for TMs I and IV. 







96 ms (TM 1) or 24 ms (TM 11) or 48 ms (TM IV) 



Figure 2.5 Transmission frame structure 



All these OFDM symbols in a transmission frame of TM II have the same duration T s « 
312 ps, except for the first one. This so-called null symbol of length T Nun « 324 ps is to be 
used for rough time synchronisation. The signal is set to zero (or nearly to zero) during this 
time to indicate on the physical level the beginning of a frame. The second OFDM symbol 
of the SC is called the TFPR (Time-Frequency-Phase Reference) symbol. The complex 
Fourier coefficients z k have been chosen in a sophisticated way so that this symbol serves as 
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a frequency reference as well as for channel estimation for the fine tuning of the time 
synchronisation. Furthermore, it is the start phase for the differential phase modulation. 
Each of the following OFDM symbols carries 384 DQPSK symbols corresponding to 768 
bits (including redundancy for error protection, see below). The three OFDM symbols of 
the FIC carry 2304 bits. Because they are highly protected with a rate 1/3 code, only 768 
data bits remain. The FIC data of each transmission frame can be decoded immediately 
without reference to the data of other transmission frames, because this most important 
information must not be delayed. The 72 OFDM symbols of the MSC carry 55296 bits, 
including error protection. This corresponds to a (gross) data rate of 2.304 Mbit/s. The data 
capacity of 55296 bits in each 24 ms time period is organised in so-called Capacity Units 
(CUs) of 64 bits. In the MSC many audio programmes and other useful data services are 
multiplexed together. Since each of them has its own error protection, it is not possible to 
define a fixed net data rate of the DAB system. 

The transmission frames of TMs I and IV have exactly the same structure. Since the 
OFDM symbols are longer by a factor of 2 or 4, respectively, the transmission frame 
length is 48 ms or 96 ms. The number of bits in the FIC and MSC increases by the same 
factor, but the data rate is always the same. 

For TM III, the frame duration is T F = 24 ms. Eight OFDM symbols carry the FIC, and 
144 OFDM symbols carry the MSC. The data rate of the FIC is higher by a factor of 4/3 
compared to the other modes. The MSC always has same data rate. 

For all four transmission modes, the MSC transports 864 CUs in 24 ms. There is a data 
frame of 864 CUs - 55296 bits common for all transmission modes that is called the 
Common Interleaved Frame (CIF). For TMs II and III, there is exactly one CIF inside the 
transmission frame. For TM I, there are four CIFs inside one transmission frame of 96 ms. 
Each of them occupies 18 subsequent OFDM symbols of the MSC. The first is located in 
the first 18 symbols, and so on. For TM IV, there are two CIFs inside one transmission 
frame of 48 ms. Each of them occupies 36 subsequent OFDM symbols of the MSC. 

2.2.3 Channel Coding 

The DAB system allows great flexibility in the choice of the proper error protection for 
different applications and for different physical transmission channels. Using rate 
compatible punctured convolutional (RCPC) codes introduced by (Hagenauer, 1988), it is 
possible to use codes of different redundancy without the necessity for different decoders. 
One has a family of RCPC codes originated by a convolutional code of low rate that is 
called the mother code. The daughter codes will be generated by omitting specific 
redundancy bits. This procedure is called puncturing. The receiver must know which bits 
have been punctured. Only one Viterbi decoder for the mother code is necessary. The 
mother code used in the DAB system is defined by the generators (133,171,145,133) in 
octal notation. The encoder is shown as a shift-register diagram in Figure 2.6. 

The mother code has the code rate R c = 1/4, that is for each data bit a { the encoder 
produces four coded bits x 0iZ , X\ th x 2 ,i and x 3(/ . As an example, the encoder output 
corresponding to the first eight data bits may be given by four parallel bit streams written in 
the following matrix (first bit on the left hand side): 





System Concept 



31 



1 

1 

1 

1 



0 110 1 
1110 0 
10 0 10 
0 110 1 



1 

1 

1 

1 



0 

0 

0 

0 



A code of rate 1/3 or 1/2, respectively, can be obtained by omitting the last one or two rows 
of the matrix A code of rate 2/3 (= 8/16) can be obtained by omitting the last two columns 
and every second bit in the second column If we shade every omitted (punctured) bit, we 
get the matrix 



0 1 
1 1 
1 0 
0 1 



1 

1 

0 

1 



0 1 
0 0 
1 0 
0 1 



0 

0 

0 

0 



For 8 data bits now only 12 encoded bits will be transmitted' the code has rate 8/12. Using 
this method, one can generate code rates 8/9, 8/10, 8/11, .8/31, 8/32 The puncturing 
pattern can even be changed during the data stream, if the condition of rate compatibility is 
taken into account (Hagenauer, 1 988) 




x 



1 i 



X, , 



Figure 2.6 Encoder for the DAB mother code 

RCPC codes offer the possibility of Unequal Error Protection (UEP) of a data stream 
some bits in the data stream may require a very low bit error rate (BER), others may be less 
sensitive against errors Using RCPC codes, it is possible to save capacity and add just as 
much redundancy as necessary 

UEP is especially useful for audio data They are organised in frames of 24 ms The first 
bits are the header, the bit allocation (BAL) table, and the scale factor select information 
(SCFSI) An error in this group would make the whole frame useless Thus it is necessary 
to use a strong (low-rate) code here The next group consists (mainly) of scale factors 
Errors will cause annoying sounds ("birdies"), but these can be concealed up to a certain 
amount on the audio level The third group is the least sensitive one It consists of sub-band 
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samples. A last group consists of Programme-associated Data (PAD) and the cyclic 
redundancy check (CRC) for error detection in the scale factor (of the following frame). 
This group requires approximately the same protection as the second one. The distribution 
of the redundancy over the audio frame defines a protection profile. An example is shown 
in Figure 2.7. 
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Figure 2.7 Example of an audio UEP profile 

The PAD may be extended to the so-called X-PAD. In this case, the PAD group size 
increases and the audio sub-band sample group decreases. It is important to note that the 
error protection does not take this into account. The X-PAD is thus worse protected (see 
section 2.3. 3.2). 

For audio data with a sampling frequency of 48 kHz, the DAB system allows 14 
different data rates between 32 and 384 kbit/s. The protection profiles for all these date 
rates are grouped into five Protection Levels PL1 to PL5. Inside each protection level 
different data rates are possible, but the robustness against errors is the same. This means, 
for example, that if a broadcaster switches between 192 and 256 kbit/s, the audio quality 
will change, but not the coverage area. PL1 is the most robust protection level, PL5 the 
least robust one. All protection levels except PL5 are designed for mobile reception; 14 data 
rates and five protection levels lead to 70 possible combinations. For 64 of them, a 
protection profile is defined. Table 2.4 shows the possible combinations and the required 
number of capacity units. 

The DAB system allows eight protection levels for Equal Error Protection (EEP). They 
are intended for data transmission. For the co-called A-profiles 1-A, 2 -A, 3-A, 4-A, all data 
rates are possible that are integer multiples of 8 kbit/s. For the B-profiles the data rate must 
be a multiple of 32 kbit/s. Table 2.6 shows the eight protection levels and their code rates. 
The third column shows the number of CUs required for a 64 kbit/s data stream. The fourth 
column shows the required SNR to reach a BER of 2-KT 4 for TM II in a Rayleigh fading 
channel with/}) max = 40 Hz. The fifth column shows the same for f Dmax = 125 Hz. 
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Table 2.4 Capacity needed for the possible combinations of audio data rates and protection levels 



Data Rate 


PL1 


PL2 


PL3 


PL3 


PL5 


32 kbit/s 


35 CUs 


29 CUs 


24 CUs 


21 CUs 


16 CUs 


48 kbit/s 


52 CUs 


42 CUs 


35 CUs 


29 CUs 


24 CUs 


56 kbit/s 


X 


52 CUs 


42 CUs 


35 CUs 


29 CUs 


64 kbit/s 


70 CUs 


58 CUs 


48 CUs 


42 CUs 


32 CUs 


80 kbit/s 


84 CUs 


70 CUs 


58 CUs 


52 CUs 


40 CUs 


96 kbit/s 


104 CUs 


84 CUs 


70 CUs 


58 CUs 


48 CUs 


1 12 kbit/s 


X 


104 CUs 


84 CUs 


70 CUs 


58 CUs 


128 kbit/s 


| 140 CUs 


1 16 CUs 


96 CUs 


84 CUs 


64 CUs 


1 60 kbit/s 


168 CUs 


140 CUs 


1 16 CUs 


104 CUs 


80 CUs 


192 kbit/s 


208 CUs 


168 CUs 


: 140 cus 


| 1 16 CUs 


96 CUs 


224 kbit/s 


232 CUs 


208 CUs 


168 CUs 


n 140 CUs 


1 16 CUs 


256 kbit/s 


280 CUs 


232 Cus 


192 CUs 


168 CUs 


128 CUs 


320 kbit/s 


X 


280 CUs 


X 


208 CUs 


160 CUs 


384 kbit/s 


416 Cus 


X 


280 CUs 


X 


192 CUs 



Note It can be seen from the figures in Table 2 4 that the coding strategy supports many possible changes of 
configuration For example, if a 256 kbit/s audio channel is split up into two 128 kbit/s channels at the same 
protection level, they will require the same capacity Furthermore, in most cases one can increase the protection 
to the next better level and lower the audio data rate by one step without changing the required capacity Such a 
diagonal of constant capacity of 140 CUs has been marked by shading in Table 2 4 It is possible to multiplex 
several audio channels of different size together, as long as their total size does not exceed 864 CUs Table 2 5 
shows as an example the number of 192 kbit/s audio programmes that can be transmitted for the different 
protection levels and the signal-to-noise ratio (SNR) that is needed at the receiver in a typical (not fast) fading 
channel (Schulze, 1995) A small capacity for data services is always left 

Table 2.5 Number of 192 kbit/s audio programmes and required SNR 



Protection Level 


Number of Programmes 


SNR 


PL1 


4 


7 4 dB 


PL2 


5 


9 0 dB 


PL3 


6 


11 OdB 


PL4 


7 


12 7 dB 


PL5 


8 


16 5 dB 



The protection levels 4-A and 4-B are very sensitive to fast fading They should not be 
used for mobile applications 

All the channel coding is based on a frame structure of 24 ms These frames are called 
logical frames They are synchronised with the transmission frames, and (for audio) with 
the audio frames At the beginning of one logical frame the coding starts with the shift 
registers in the all-zero state At the end, the shift register will be forced back to the all-zero 
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state by appending 6 additional bits (so-called tail bits) to the useful data to help the Viterbi 
decoder. After encoding such a 24 ms logical frame builds up a punctured codeword. It 
always contains an integer multiple of 64 bits, that is an integer number of CUs. Whenever 
necessary, some additional puncturing is done to achieve this. A data stream of subsequent 
logical frames that is coded independently of other data streams is called a sub-channel. 
For example, an audio data stream of 192 kbit/s is such a possible sub-channel. A PAD 
data stream is always only a part of a sub-channel. After the channel encoder, each sub- 
channel will be time interleaved independently as described in the next subsection. After 
time interleaving, all sub-channels are multiplexed together into the common interleaved 
frame (CIF). 



Table 2.6 EEP levels: code rate, 64 kbit/s channel size, and required SNR 



Protection Level 


Rc= 


Size of 64 kbit/s 


SNR (40 Hz) 


SNR (125 Hz) 


1-A 


1/4 


96 CUs 


5.0 dB 


5.4 dB 


2-A 


3/8 


64 CUs 


7.1 dB 


7.6 dB 


1-B 


4/9 


54 CUs 


8.4 dB 


8.8 dB 


3-A 


1/2 


48 CUs 


9.3 dB 


10.0 dB 


2-B 


4/7 


42 CUs 


10.6 dB 


11.5 dB 


3-B 


4/6 


36 CUs 


12.3 dB 


13.9 dB 


4-A 


3/4 


32 CUs 


15.6 dB 


19.0 dB 


4-B 


4/5 


30 CUs 


16.2 dB 


21.5 dB 



Convolutional codes and their decoding by the Viterbi algorithm are treated in 
textbooks about coding, see for example (Clark, 1988), (Bossert, 1998), (Proakis, 1995). 
The paper of (Hoeher, 1991b) gives some insight how the channel coding for DAB audio 
has been developed. It reflects the state of the research work on this topic a few months 
before the parameters were fixed. Bit error curves of the final DAB coding scheme in a 
mobile fading channel and a discussion of the limits of the system can be found in the paper 
of (Schulze, 1995). 



2. 2 . 4 Interleaving and PSK Mapping 

For an efficient error correction with a convolutional code, a uniform distribution of 
channel bit errors (before the decoder) is necessary. A mobile radio channel produces burst 
errors, since many adjacent bits will be disturbed by one deep fade. For OFDM, this holds 
in time and in the frequency direction. To reach a more uniform distribution of badly 
received bits in the data stream before the decoder, the encoded bits will be spread over a 
larger time-frequency area before being passed to the physical channel. This procedure is 
called (time and frequency) interleaving. At the receiver, this spreading has to be inverted 
by the deinterleaver to restore the proper order of the bit stream before the decoder. 
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2.2.5 Time interleaving 

To spread the coded bits over a wider time span, a time interleaving will be applied for each 
sub-channel. It is based on a so-called convolutional interleaver. First, the codeword (i.e. 
the bits of one logical frame) will be split up into small groups of 16 bits. The bits with 
number 0 to 15 of each group will be permuted according to the bit reverse law (i.e. 0— >0, 
1— >>8, 2— >4, 3— >12,..., 14— >7, 15— >15). Then, in each group, bit no. 0 will be transmitted 
without delay, bit no. 1 will be transmitted with a delay of 24 ms, bit no. 2 will be 
transmitted with a delay of 2x24 ms, and so on, until bit no. 15 will be transmitted with a 
delay of 15x24 ms. At the receiver side, the deinterleaver works as follows. In each group 
bit no. 0 will be delayed by 15x24 ms, bit no. 1 will be belayed by 14x24 ms, and so on, bit 
no. 14 will be delayed by 24 ms, and bit number 15 will not be delayed. Afterwards, the bit 
reverse permutation will be inverted. Obviously, the deinterleaver restores the bit stream in 
the proper order, but the whole interleaving and deinterleaving procedure results in an 
overall decoding delay of 15x24 ms = 360 ms. This is a price that has to be paid for a better 
distribution of errors. A burst error on the physical channel will be broken up by the 
deinterleaver, because a long burst of adjacent (unreliable) bits before the deinterleaver will 
be broken up so that 2 bits of a burst have a distance of at least 16 after the deinterleaver 
and before the decoder. 

The time interleaving will only be applied to the data of the MSC. The FIC has to be 
decoded without delay and will therefore only be frequency interleaved. 



2.2. 6 DQPSK Modulation and Frequency Interleaving 

Because the fading amplitudes of adjacent OFDM sub-carriers are highly correlated, the 
modulated complex symbols will be interleaved. This will be done with the QPSK symbols 
before interleaving. We explain it by an example for TM II: a block of 768 encoded bits 
have to be mapped onto the 384 complex coefficients for one OFDM symbol of duration 
T s . The first 384 bits will be mapped to the real parts of the 384 QPSK symbols, the last 
384 bits will be mapped to the imaginary parts. To write it down formally, the bits of the /th 
block p tl (z = 0, 1,..., 2W-1) will be mapped to the QPSK symbols q u (i = 0,1,..., K-l) 
according to the rule 



<*, - [(l - - 2 PlJ )+ y (l - 2 p i+KJ )] , i = 0,1,...* - 1 ■ 



(2.7) 



The frequency interleaver is simply a renumbering of the QPSK symbols according to a 
fixed pseudo-random permutation F(i), as shown in Table 2.7. The QPSK symbols after 
renumbering are denoted by y k j (k = ± 1, ±2, ±3,..., ±KI 2). 



Table 2.7 Permutation for frequency interleaving (TM II) 

/ 0 1 2 3 4 5 ... 380 381 382 383 

k~F(i) -129 -14 -55 -76 163 141 ... -116 155 94 -187 
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The frequency interleaved QPSK symbols will be differentially modulated according to 
the law 

z k,i= z k,i-\-ykr ( 2 - 8 ) 

The complex numbers z kl are the Fourier coefficients of the OFDM symbol no. / in the 
frame. 



2.2. 7 Performance Considerations 

Sufficient interleaving is indispensable for a coded system in a mobile radio channel. Error 
bursts during deep fades will cause the Viterbi decoder to fail. OFDM is very well suited 
for coded transmission over fading channels because it allows time and frequency 
interleaving. Both interleaving mechanisms work together. An efficient interleaving 
requires some incoherency of the channel to get uncorrelated or weakly correlated errors at 
the input of the Viterbi decoder. This is in contrast to the requirement of the demodulation. 
A fast channel makes the time interleaving more efficient, but causes degradations due to 
fast phase fluctuations. The benefit of time interleaving is very small for fp mdiX < 40 Hz. On 

the other hand, this is already the upper limit for the DQPSK demodulation for TM I. For 
even lower Doppler frequencies corresponding to moderate or low car speeds and VHF 
transmission, the time interleaving does not help very much. In this case, the performance 
can be saved by an efficient frequency interleaving. Long echoes ensure efficient frequency 
interleaving. As a consequence, SFNs support the frequency interleaving mechanism. If, on 
the other hand, the channel is slowly and frequency flat fading, severe degradations may 
occur even for a seemingly sufficient reception power level. A more detailed discussion of 
these items can be found in the paper of (Schulze, 1995). 



2.3 The DAB Multiplex 

2.3.1 Mode-Independent Description of the Multiplex 

The DAB system is designed for broadcasting to mobiles in the frequency range from 30 
MHz to 3 GHz. This cannot be achieved by a single OFDM parameter set, so four different 
transmission modes are defined (see section 2.2.1). The DAB multiplex, however, can be 
described independently of the transmission mode. To achieve this, containers of 
information are defined which are used to transmit the data of applications (audio and data 
services, service information, etc.) to the receivers. Figure 2.8 shows the generation of the 
DAB multiplex. 

The data of audio components and other applications are carried in what is called the 
Main Service Channel (MSC). Every 24 ms the data of all applications are gathered in 
sequences, called Common Interleaved Frames (CIFs). Multiplex and service-related 
information is mainly carried in the Fast Information Channel (FIC). Similar to the MSC, 
FIC data are combined into Fast Information Blocks (FIBs). 
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Figure 2.8 Generation of the DAB multiplex 

Depending on the transmission mode, a number of CIFs and FIBs are grouped together 
into one transmission frame which is the mapped to a number of OFDM symbols (see 
section 2.2.2). 



2.3.2 The Main Service Channel 

The MSC of the DAB system has a gross capacity of 2.304 Mbit/s. Depending on the 
convolutional code rate, the net bit rate ranges from approximately 0.6 to 1.8 Mbit/s. Single 
applications do not normally consume this overall capacity. The MSC is therefore divided 
into sub-channels. Data carried in a sub-channel are convolutionally encoded and time 
interleaved. Figure 2.9 shows the conceptual multiplexing scheme of the DAB system. 

The code rate can differ from one application to another. The data rates available for 
individual sub-channels are given by integer multiples of 8 kbit/s (of 32 kbit/s for some 
protection schemes). Figure 2.10 shows an example for a multiplex configuration. Each 
sub-channel can be organised in stream mode or packet mode. 

The division of the MSC into sub-channels and their individual coding profiles are 
referred to as the Multiplex Configuration. The configuration is not fixed but may be 
different for different DAB transmissions or may vary from time to time for the same 
transmission. Therefore the multiplex configuration must be signalled to the receivers. This 
is done through the FIC. 
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Figure 2.9 Generation of the common interleaved frames (CIFs) from a) an audio sub-channel, 
b) a general data stream mode sub-channel, c) a packet mode sub-channel. 



36 kbit/s Data 




Figure 2.10 An example of how the overall capacity is assigned to individual applications (capacity 
pie). 
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2. 3. 3 Transport Mechanisms 

The DAB system provides several mechanisms to transport data to receivers which are each 
tailored to specific purposes. 

The Stream Mode is designed for continuous and synchronous data streams such as 
coded audio. In this case every 24 ms the same number of bits is transmitted transparently, 
that is, there is no mechanism inside the DAB system to provide synchronisation or 
addressing except by the frame structure. All the necessary signalling has to be provided by 
the application itself. Stream mode sub-channels may contain audio encoded according to 
ISO 1 1 172-3 Layer II or general data at fixed bit rate. 

For asynchronous data there is Packet Mode, which provides a protocol for conveying 
single data groups through a packetised channel. The packet protocol allows repetition of 
data to be handled and the creation of a multiplex of several parallel applications, to which 
the capacity can flexibly assigned. 

A special way of transport is provided for Programme-associated Data (PAD) which are 
inserted into the MPEG Layer II audio data stream by defining a structure of the Auxiliary 
Data field of the MPEG audio frame specific to DAB. It provides a number of functions 
related to the contents of the audio programme and can be inserted at the place where the 
audio is produced. Therefore PAD is considered to be a part of the audio and not really a 
separate transport mechanism for data. 

2.3.3. 1 Stream Mode 

Stream mode is used for applications which can provide a constant data rate of a multiple of 
8 kbit/s (32 kbit/s for the B coding profiles, see section 2.2.3). For example, at a sampling 
rate of 48 kbit/s, the MPEG Layer II audio encoder generates a data frame every 24 ms 
which exactly meets this requirement. When transmitting general data, the data stream can 
be divided into "logical frames" containing the data corresponding to a time interval of 24 
ms. These logical frames can be transmitted one after the other in the same manner as 
MPEG audio frames. 

When stream mode is used there are two options for error protection. Unequal error 
protection (UEP) is used with MPEG audio and provides error correction capabilities which 
are tailored to the sensitivity of the audio frame to bit errors (see section 2.2.3). For general 
data, equal error protection (EEP) is used, where all bits are protected in the same way. 

2.3.3. 2 Programme-associated Data 

Although DAB provides mechanisms to carry general data in packet mode, there is a need 
for an additional method for transmitting data which is closely linked to an audio service. 
This is referred to as PAD. 

At the end of the MPEG audio frame there is an auxiliary data field which is not further 
specified in MPEG. For the DAB audio frame, however, this field is used to carry the CRC 
for the scale factors (see Chapter 3), and two PAD fields, see Figure 2.11. The DAB system 
is transparent for the PAD. This means that the PAD can be produced and inserted at the 
time when the audio signal is coded, normally at the studio (see Chapters 5 and 6) and will 
be retrieved only when decoding the audio signal in the receiver. 

The fixed PAD (F-PAD) field enjoys the same protection level as the SCF-CRC field 
and hence is well protected. Therefore, it can be used to signal Dynamic Range Control 
(DRC, see section 5.4) and other control information to the receiver. 
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End of DAB audio frame 




Figure 2.11 The PAD fields at the end of the DAB audio frame 

The extended PAD (X-PAD) field can be used to send larger amounts of data (up to 64 
kbit/s), for example text messages or multimedia objects (see Chapter 4). To create a 
flexible multiplex structure within the PAD, a special packet structure was developed. The 
data are arranged in X-PAD-Data Groups which each consist of a data field and a 
Contents Indicator which signals the kind of data carried in the corresponding data field 
and in some cases the number of bytes of the data field. Data fields of several applications 
can be carried in parallel in the PAD section of one audio frame. 

The extended PAD (X-PAD) field partly enjoys the protection level of the SCF-CRC; 
the larger part, however, only enjoys the protection level of the sub-band samples. 
Therefore the contents indicators are separated from the data fields and collectively sent in 
the better protected part to ensure that the important information on the contents and 
amount of data can be received with high probability. 

2.3.3.3 Packet Mode 

The most general DAB transport mechanism is the packet mode structure. 

For packet mode transmission, the data is organised in data groups which consist of a 
header, a data field of up to 8191 bytes and optionally a cyclic redundancy check (CRC) 
for error detection. The data group header allows identification of different data group 
types such that, for instance, scrambled data and the parameters to access them can be 
carried in the same packet stream. There are also counters which signal if and how often 
the data group will be repeated. An extension field offers the possibility to address end 
user terminals or user groups. Data groups are transmitted in one or several packets. 

The packets themselves constitute the logical frames in packet mode similar to the 
audio frames in stream mode. To fit into the DAB structure requiring a multiple of 8 
kbit/s, that is a multiple of 24 bytes per 24 ms, packet lengths of 24, 48, 72 and 96 bytes 
are defined in the DAB standard. The packets consist of a 5 byte header, a data field, 
padding if necessary, and a CRC for error detection. The packets may be arranged in any 
order in the transmission. The header of each packet signals its length so that the decoder 
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can find the next packet. An important feature of packet mode is that padding packets can 
be inserted if no useful data are available to fill the data capacity. Packet mode is therefore 
suitable for carrying asynchronous data. Up to 1023 applications may be multiplexed in a 
packet mode transmission. 

Figure 2.12 shows an example of a packet mode transmission. Two applications are 
carried in parallel. From each application, a suitable amount of data, for example one file, is 
selected to form a data group. The corresponding header is added to the data and the CRC 
is calculated. Each data group is mapped on to a number of packets, each of which may 
have a different length. The first, intermediate and last packets carrying data of one data 
group are marked accordingly in their headers. The packets of different applications use 
different packet addresses. Therefore, the sequence of the packets belonging to the first 
application may be interrupted by those of the second application. However, within each 
sequence of packets, the transmission order has to be maintained to allow the decoder to 
correctly reassemble the data group. 

For more details of the data group and packet headers the reader is referred to (EN 
300401) and (TR 101496). 





Figure 2.12 Example of a packet mode transmission multiplexing two applications 
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2. 3. 4 Fast Information Channel 

The FIC is used to signal the multiplex configuration of the DAB transmission. Therefore, 
it uses fixed symbols in the transmission frame which are known to the receivers. The 
receiver needs this information to be able to decode any of the sub-channels. Therefore, for 
instant data acquisition, the FIC data are not time interleaved (hence the name ’’fast"). 
Instead, a high protection (code rate 1/3) and frequent repetition of the data is used to 
guarantee high availability. 

The FIC consists of a number of Fast Information Blocks (FIBs) which carry the 
information (see Figure 2.13). Each FIB is made up from 32 bytes: 2 CRC bytes are used 
for error detection, the remaining 30 bytes are filled with Fast Information Groups (FIGs) in 
which the Multiplex Configuration Information (MCI) and other information are coded. 
The FIGs are distinguished by a type field. FIGs of type 0 are used for the MCI and Service 
Information (SI), FIGs of type 1 are used to send text labels, FIGs of type 5 are used to send 
general data in the FIC (Fast Information Data Channel, FIDC), FIGs of type 6 are used 
with access control systems and FIGs of type 7 are used for in-house data transmission by 
the broadcaster. Some FIG types use extensions to indicate different meanings of the data 
fields, e.g. FIG type 0 extension 1 is used to signal the sub-channel organisation while FIG 
type 0 extension 10 is used to send the date and time. 

Fast Information Block (FIB), 32 bytes 

◄ ► 



FIG 1 


FIG 2 


■ 


FIG n 


Padding 


CRC 






■ 




(if neccessary) 






Header Data 

Type Length Extension 



Figure 2.13 Structure of a Fast Information Block (FIB) and a Fast Information Group (FIG) 

In special cases the data rate provided by the FIB symbols in the transmission frame (32 
kbit/s) may not be sufficient to carry all of the FIC information with the desired repetition 
rate. In this case, part of this information can be sent in parallel in packet mode sub-channel 
#63 with packet address 1023. This is referred to as the Auxiliary Information Channel 
(AIC). The corresponding packet mode data group contains the FIGs as defined for the FIC. 
Data concerning other DAB transmissions may be sent only in the AIC. For more details 
see section 2. 5. 2. 4. 
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2. 3. 5 Transmission Frames 

From FIBs and CIFs transmission frames are formed which are then mapped to OFDM 
symbols. Table 2.8 lists the duration of the frames and the number of FIBs and CIFs used in 
each transmission mode. 



Table 2.8 The frames of the four DAB transmission modes 



Mode 


Duration of 
Transmission Frame, 
ms 


Number of CIFs per 
Transmission Frame 


Number of FIBs per 
Transmission Frame 


TMI 


96 


4 


12 


TM II 


24 


1 


4 


TM III 


24 


1 


3 


TM IV 


48 


2 


6 



2. 3. 6 The logical structure of the DAB Multiplex 

The different data streams carried in the DAB multiplex can be grouped together to form 
what is called a Service. The services can be labelled (e.g. "Bayern 3") and it is the services 
from which the listener makes his or her choice. All services taken together are referred to 
as an Ensemble. 

The different data streams (e.g. audio, data, etc.) which belong to one service are called 
its Service Components. Different services may share components, and the logical structure 
of services, service components and the position in the CIF where the data of the 
components are actually carried is signalled as part of the MCI in the FIC. Figure 2.14 
shows an example of this structure. The ensemble labelled DAB 1 contains a number of 
services, among them Radio 11, Radio 12, Data 29 and Radio 3. Radio 3 is a service which 
consists of an audio component only, that is a "normal'* radio programme. Radio 1 1 and 
Radio 12 each consist of an audio component and share Radio 1 data as a common data 
component which carries information relevant to both programmes or programme- 
independent data such as traffic messages. At times, say during a news programme on the 
hour, Radio 11 and Radio 12 broadcast the same audio signal. Instead of transmitting the 
corresponding bits twice, it is possible to signal this at the level of service components. The 
capacity in the CIF usually used for the audio component of Radio 12 (i.e. Sub-ch. 7) can 
be reused during this time for another service or component, for example slow motion 
video. Data 29 is a data service without an audio component but with two separate data 
components. Figure 2.14 also shows how the different components are transported in the 
MSC. It can be seen that different components (Radio 1 data and Data 29 data 2) may share 
a packet mode sub-channel while stream mode components each require an individual sub- 
channel. 
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ENSEMBLE 



SERVICES 



SERVICE 

COMPONENTS 



COMMON 

INTERLEAVED 

FRAME 



Figure 2.14 Example of the logical structure of the DAB multiplex 

It is also apparent in the figure which information has to be sent in the FIC: the sizes 
and position of the sub-channels in the CIF and their respective code rates are signalled as 
“Sub-channel organisation” in FIG type 0 extension 1, the services and their components 
are described by "Basic service and service component definition" in FIG type 0 extension 
2 and 3 (used for packet mode). 

Of course there is a need to provide further information about each service, such as 
service label, programme type, programme number for recorder control, indication of 
announcements (e.g. for automatically switching to a service broadcasting traffic 
information), alternative frequencies and frequencies of other DAB transmissions. This 
kind of information may be displayed in order to assist the listener in operating the receiver 
or to enable the receiver to optimise the reception. The DAB system provides a number of 
sophisticated mechanisms for this purpose, far beyond what has been realised by the Radio 
Data System (RDS) in FM broadcasting (see section 2.5). 



2. 3 . 7 Multiplex Reconfiguration 

To achieve a high flexibility, the multiplex may be reconfigured from time to time during 
transmission. When the multiplex configuration is about to change, the new information, 
together with the timing of the change, is transported via the MCI and indicates in advance 
what changes of the multiplex are going to take place. Because of the time interleaving, 
special precaution has to be taken when the size of a sub-channel changes. For details the 
reader is referred to Annex D of (EN 300401). 
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2.4 Conditional Access 

Access control may be an important feature especially for data applications on DAB. But 
also scrambled audio services could be required in some broadcasting environment. To 
serve these needs, an access control mechanism is included in the DAB specifications. 

The access control system consists of three major parts: 

• scrambling / descrambling, 

• entitlement checking, 

• entitlement management. 

By using these means, a number of features can be realised, such as subscription per 
theme, level or class, pay per programme, per service or time. The only part of the system 
which is specified in the DAB standard (EN 300401) is the scrambling and descrambling 
process. For entitlement checking and management different systems, including proprietary 
ones, may be used. 



2. 4. 1 Scrambling and Descrambling the Bits 

In the scrambling system which is used by DAB, the individual bits are scrambled by 
adding (using the logical AND function, or adding "modulo 2") a pseudo-random binary 
sequence (PRBS) to the data. The PRBS is derived from an Initialisation Word (IW) fed 
into a PRBS generator, which consists of a number of coupled shift-register stages and is 
often implemented as an integrated circuit. The PRBS generator used in DAB is the same 
as the one used for scrambling TV signals and is specified in (EN 300174). 

A receiver with the PRBS generator implemented can therefore descramble the received 
signal and hence access the service if it can generate the correct IWs at all times. The IWs 
change very often to restart the PRBS generator. This avoids long delays when the service 
is accessed and ensures that only short PRBSs are generated from which the IW cannot be 
derived. This is important for broadcast applications because the PRBS generator is 
publicly known and hence a pirate could acquire an entitlement and calculate the PRBS by 
comparing the scrambled and unscrambled signals. 

To allow for frequently changing IWs they are generated from two sources: the Control 
Word (CW), which is provided by the entitlement checking entity (see below), and the 
Initialisation Modifier, which is either generated locally from the DAB frame count or 
transmitted along with the data in plain (i.e. unscrambled). 

In stream mode (audio and data), the complete stream is scrambled, and the PRBS 
generator is initialised at the beginning of each logical frame. In packet mode and in the 
FIDC only the data fields of the packets or FIGs are scrambled, not the headers and CRCs. 
In packet mode it is also possible to scramble an entire data group before distributing the 
data to different packets. 





46 



Digital Audio Broadcasting: Principles and Applications 



2. 4.2 Checking Entitlements 

To be able to check if a receiver is entitled to unscramble a service, the service provider 
sends an Entitlement Checking Message (ECM) to the access control systems in the 
receivers. This message contains the CW scrambled by a second security system, which is 
kept secret and in which the different access control systems differ, and the conditions 
which the user has to fulfil to be able to unscramble the CW. In the access control system in 
the receiver these parameters are stored, for example on a smart card which is inserted in 
the receiver and which the user has purchased and paid for. The ECM is descrambled in the 
access control system and the parameters sent by the service provider are compared with 
those stored locally. If they match, the CW and hence the IW can be calculated. 

Depending on the transport mechanisms, several ways of transmitting the ECMs are 
possible. 

In stream mode (audio or data) the ECMs have to be sent in a separate channel because 
there is no means of multiplexing the scrambled data and the ECMs inside the sub-channel. 
The ECMs are therefore transported in the FIC (FIG type 6) or in sub-channel #63 using 
packet mode. 

In packet mode, it is possible to send both the scrambled data and the ECMs in the same 
sub-channel, because they can be distinguished by their data group headers. This makes it 
possible to store a service component sent in packet mode (e.g. on a computer disk) and 
descramble off-line using software only instead of a PRBS circuit. 



2. 4. 3 Entitlement Management 

The entitlement management function makes it possible to update the access parameters 
stored in the receivers. This may be used to extend a subscription over the air without the 
need to send a new smart card to the subscriber. 

For this purpose the service provider sends Entitlement Management Messages (EMMs) 
to individual decoders, which therefore need to have a unique address. For certain purposes 
it is also necessary to address groups of decoders or all decoders. The EMMs are also 
carried in packet mode data groups or in the FIC. 



2.5 Service Information 

2.5.1 Introduction 

Service Information (SI) provides additional information about the services carried in an 
ensemble and is intended to simplify service access and to provide attractive receiver 
features and functionality. Similar information carried in other ensembles and/or in AM/FM 
services may also be signalled. Unlike the MCI (Multiplex Configuration Information), SI 
features are optional, that is broadcasters and receiver manufacturers may choose which to 
implement and when. 
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2. 5 . 2 Basic Information 

2.5.2. 1 Ensemble Label 

The listener basically finds a programme by selecting a service in an ensemble. 

The ensemble label coded in FIG type 1 Extension 0 contains the Ensemble Identifier 
(Eld) and a character field formed by a string of 16 characters which provide the listener 
with a clear textual description for the ensemble. A character flag field contains a flag for 
each character in the ensemble label. When a bit is set to “1”, the corresponding character 
of the ensemble label is included in an abbreviated label for display on receivers having a 
display of less than 16 characters. 

Example: 

Ensemble label 16 characters 
Character flag field : 

Ensemble label 8 characters : 

2.5.2.2 Date and Time 

The date and time information in DAB is encoded in MJD (Modified Julian Date) and UTC 
(Co-ordinated Universal Time). The MJD is a continuous count of the number of days 
elapsed since 17 November 1858. The UTC is a 24 hour atomic time system that forms the 
basis of most radio time systems. 

The data and time feature is encoded in FIG type 0 extension 10 which contains besides 
the MJD and the UTC also a leap second indicator (LSI) and a confidence indicator. The 
ensemble provider should set the leap second indicator to “1” throughout a UTC day that 
contains a leap second. The confidence indicator is set to “1” when the time information is 
within an agreed tolerance. The MJD is coded as a 17-bit binary number that increments 
daily at 0000 (UTC) and extends over the range 0-99999. The following formula allows the 
MJD to be calculated from the year (Y), month (M) and date (D): 

MJD = 14956 + D + int [ (Y-L) • 365.25 + int [ M + 1 + L*12)*30.6001 ]. For January 
and February (i.e. for M = 1 or M = 2) variable L = 1, otherwise L = 0. 

Example : 10 October 2004 -> MJD = 14956 + 10 + int [ (104-0)* 365.25 + int [ 10 + 1 + 
0T2)*30.6001 ] = 69715 = 0010001000100000101001 1 =0x11 1053. 

FIG 0/10 allows the time information to be provided at two different resolution levels. 
The standard resolution allows the time to be expressed in 1 minute intervals (short form of 
FIG 0/10) whilst the high-resolution level allows timing to be expressed in millisecond 
intervals (long form of FIG 0/10). Five bits are used to represent the hour (in the range 0 - 
23) and 6 bits are used to represent the minutes. The long-form version contains 
additionally a 6-bit “seconds” and a 10-bit “milliseconds” sub-field. 

Example: 10 October 2004 at 14h 45min 25sec 50 ms: 



BBC_National 

0011111110000100 
BBC Natl 



MJD 


Hours 


Minutes 


Seconds 


Milliseconds 


00100010001000001010011 


OHIO 


101101 


0110001 


0000110010 


(69715= 10 Oct 2004) 


(14) 


(45) 


(25) 


(50) 
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2. 5.2.3 Country, Local Time Offset, International Table 

A number of basic, mostly static, ensemble parameters are encoded in FIG type 0 extension 
9 (FIG 0/9): 

• The Ensemble ECC (Extended Country Code) specifies together with the country code 
of the Eld a world-wide unique ensemble identifier. In an international ensemble it is 
possible to assign different ECCs to services. This is done in the extended field. The 
ECC is crucial in case of PTy preview or PTy downloading (see section 2. 5.4. 2) and 
also for TMC (see Chapter 4). 

• The Ensemble LTO defines the local time offset of the ensemble from the UTC 
transmitted in FIG type 0 extension 10 (see section 2. 5. 2. 2). It is expressed in multiples 
of half an hour (in the range -12 hours to +12 hours). In an international ensemble it is 
possible that some services use a different LTO. This can be signalled in the extended 
field. 

• The Inter. Table Id indicates which international table is applicable in the ensemble. 
Two international tables are defined: 

- Table 00000001 comprises the European table of RDS PTY codes (except for codes 

30 and 31) as specified in CENELEC EN 50067 and the table of announcement 
types specified in ETSI EN 300401. 

- Table 00000010 comprises the RBDS list of PTY codes as specified in the RBDS 

standard and the table of announcement types specified in ETSI EN 300401. 

The coding allows the definition of future table identifiers, for example for use in Asian or 
African countries where there might be a need for another list of PTys and/or another list of 
announcement types. The use of different international tables in one ensemble is not 
allowed. In Europe only Table 00000001 is applicable. 

2.5.2.4 FIC Redirection 

When there is a risk for overload in the FIC some FIGs may be transferred either as a whole 
or in part to the Auxiliary Information Channel (AIC). The AIC uses packet mode 
transport. FIC redirection is encoded in FIG type 0 extension 3 1 . The FIG type 0 flag field 
and the FIG type 1 flag field indicate which FIGs of type 0 resp. type 1 are carried in the 
AIC. All MCI FIGs should remain in the FIC. FIGs applying to other ensembles may be 
entirely transferred to the AIC. Also FIG type 1 extension 2 (PTy downloading) and FIG 
type 0 extension 30 (satellite database) may be entirely transferred to the AIC. Other FIGs 
may be present partly in the FIC and partly in the AIC. For some of them the information is 
repeated more frequently in the FIC than the AIC or vice-versa. This ensures that those 
receivers not equipped with a packet mode decoder will be able to access the basic data 
features though their performance may be inferior to other receivers. 



2. 5. 3 Service-related Information 

2. 5.3.1 Programme Service Label and Data Service Label 

The service label feature comprises a character field forming a string of up to 16 characters 
and provides the listener with a textual description of the service name. The characters are 
coded from byte 15 to byte 0. The first character starts at byte 15. The character field is 
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linked to a machine-readable Service Identifier (Sid). In the case of a programme service 
label the Sid comprises 16 bits and is similar in format to the RDS Programme 
Identification (PI) code: the 4 MSBs form the Country Id which distinguishes between 
fifteen countries while the remaining 12 bits are the service reference. In the case of a data 
service label the Sid comprises 32 bits: the 8 MSBs represent the service ECC, the next 4 
bits the Country Id and the remaining 20 bits the service reference. The character flag field 
allows identification of up to eight characters from the service label to suit receivers with a 
short display. If the character field contains less than 16 characters, the unused bits in the 
character flag field (having no corresponding character) are set to zero as shown in Table 
2.9. 

Table 2.9 Example of service labels 





Programme Service Label 


Data Service Label 


FIG 


FIG 1/1 


FIG 1/5 


SId 


16 bits 


32 bits 




4 bits Country Id 


8 bits service ECC 




12 bits service reference 


4 bits Country Id 
20 bits service Reference 


Example of service label: 
16 characters 


BBC_Rl_Digital__ 


RADIO_l_TMC 


character flag field 


01111111100000000 


on oooooin 1 1 ioo 


Service label 8 characters 


_BBC_R1_ 


_ R1_TMC_ 



Whereas the ensemble label (see section 2.5.2. 1) is expected to change very rarely, if ever, 
the name of the service will probably change only when the programme is continued by a 
different service provider, which may happen a few times per day, if ever. 

2.5.3. 2 Service Component Label 

Service component labels are useful for the user to distinguish between service components 
of the same type. All service components are accessed at the service level, which means 
that service components can only be requested after their parent service has been selected. 
The receiver is required to support an additional layer of service access: firstly to offer the 
services and secondly to offer the components of a chosen service. The service component 
label is encoded in FIG type 1 extension 4 in terms of the SCIds, which identifies the 
service component within the service. 

Service component labels may change frequently. It is recommended that the receiver 
follows any changes to these labels to avoid obsolete information remaining in the display. 



Example: Service “RADIO 1” with Sid = F123 (hex) and service label “RADIO 1” has in 
addition to the primary service component two secondary (audio) service components 
labelled as “RADIO 1 EXTRA” and “RADIO 1 PLUS”: 





50 



Digital Audio Broadcasting: Principles and Applications 



Service 


Service Label 


SCIdS 


Service Component Label 


F123 


RADIO 1 




RADIO _1_ EXTRA 








RADIO_l_ PLUS 



2. 5.3.3 Service Component Trigger 

The beginning or the end of the transmission of a service component can be signalled by 
means of the Service Component Trigger (SCT) feature. It is primarily used with data 
services or multimedia services. In the case of receivers processing the FIC or the AIC in a 
continuous way, the SCT may be transmitted in the FIC or in the AIC at any time in order 
to signal the beginning or end of the transmission of a service component. The same SCT 
may be transmitted several times without any constraint on the transmission time. When a 
receiver is a member of a service user group and it satisfies the access conditions, it shall 
look for the service carried in the MSC in stream or packet mode or in the FIDC, at the time 
indicated by the SCT. Since there may be a delay between the real time of the reception of 
the service (component) and the time indicated in the SCT the SCT should be transmitted 
by enough time in advance. For the start of the service, the actual beginning of the 
reception of the service (component) shall be later than the time indicated in the SCT. For 
the end of the service (component), when signalled, the end time indicated shall be later 
than the actual end time. 

The SCT feature is encoded in FIG type 0 extension 20 (FIG 0/20) which comprises the 
service component description in terms of the service component type (ASCTy or DSCTy). 
This FIG also signals where the component is located: in the FIDC or in the MSC in stream 
mode (short form of FIG 0/20) or in the MSC in packet mode (long form of FIG 0/20). FIG 
0/20 further contains a number of flags. One of the flags signals whether the Logical Frame 
Number (LFN) applies to the beginning or end of service transmission. Another flag is used 
to signal whether the optional service user group information (defining specific groups of 
receivers) is present or not. Additional information may be provided to signal the time 
(UTC in 1 minute resolution) and conditional access parameters (CAId, SCCA). 

If the time value is set to “0”, or is absent, and if LFN = “1FFF”, the transmission of the 
indicated service (component) is considered to have just started or ended. 

2. 5.3.4 Programme Number 

The Programme Number (PNum) is a code that enables receivers and recorders designed to 
make use of this feature to respond to the particular programme item(s) that the user has 
pre-selected. In a receiver PNum may be implemented in a similar way as PTy in WATCH 
mode (see also section 2. 5. 4. 2). However, whereas PTy WATCH is used to switch on or 
record a particular genre of programme, PNum allows a specific programme to be switched 
on or recorded. 

PNum uses the scheduled programme time of the programme, to which the day of the 
month is added in order to avoid ambiguity. PNum together with the Sid of the service that 
has scheduled the programme item form a unique identifier for this programme item. PNum 
is similar to the 16-bit PIN (Programme Item Number) in RDS. The first 5 MSBs (b 1 5 to 
bll) represent the day of the month, in the range 00-31, the next 5 bits (blO to b6) the 
hour, in the range 00-23, and the final 6 bits (b5 to bO) the minute, in the range 00-59. 
When the date part (b 1 5 to bl 1) of PNum - 00000 special codes are allowed for the hours 
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and minutes to signal a status code, a blank code or an interrupt code as shown in Table 

2 . 10 . 

PNum is encoded in FIG type 0 extension 16 where it is associated with the Sid to 
provide a unique reference for each programme. The Continuation flag indicates whether or 
not there will be a planned interruption while the Update flag indicates a re-direction to a 
different service (New Sid) and time (New PNum). New Sid thus indicates the Sid of the 
target service of the redirection while New PNum specifies the new emission time or the 
time at which the programme will be resumed. It is coded in the same way as PNum. 



Table 2.10 Special PNum codes 



Date 

bl5-bll 


Hours 

b!0-b6 


Minutes 

b5-b0 


Description 


Meaning 


00000 


00000 


000000 


Status code 


no meaningful PNum is provided 


00000 


00000 


000001 


Blank code 


not worth recording (e.g. a non-scheduled 
programme during an intermission) 


00000 


00000 


000010 


Interrupt code 


unplanned interruption of a programme, 
which will continue after a short interval 
(e.g. during a traffic announcement) 



Example 1: Redirection of a programme item 

Suppose Radio 1 (Sid = FI 23) has scheduled on the 18th of the month, at 16.25h, a 
programme item called “Festival d’ Avignon”. At 17.00h the news is scheduled so that 
“Festival d’Avignon” is continued on Radio 3 (Sid = F456) where it replaces a scheduled 
programme item “Today in Paris” which is deleted. 

The PNum for “Festival d’ Avignon” has D = 18 = 10010; H = 16 = 10000; M = 25 = 
011001 so that PNum = 1001 0100 0001 1001 =0x9419. 

At 16.25h the following FIG 0/16 is transmitted: 



SId 


PNum = 9419 


Cont 


Upd 


Date 


Hour 


Min 


F123 


18 


16 


25 


0 


0 



Receivers programmed to start recording “Festival d’ Avignon” on Radio 1 can start 
recording this programme item. These receivers have to be informed that this programme 
item will continue at 17.00h on Radio 3. Since “Festival d’ Avignon” is redirected to service 
F456 at 16.59h, the parameters for the New PNum are: D = 18 = 10010; H = 16 = 10000; 
M = 59= 111011 so that New PNum = 1001 0100 0011 1011 =0x943B. 

At 16.57h the following FIG 0/16 is transmitted during at least the last minute of the 
programme: 
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Sid 


PNum = 9419 


Cent 


Upd 


New 

Sid 


New PNum = 943B 


Date 


Hour 


Min 


Date 


Hour 


Min 


F123 


18 


16 


25 


1 


1 


F456 


18 


16 


59 



The receiver is now informed that the programme item with PNum = 9419 (D18-H16- 
M25), namely “Festival d’ Avignon”, continues on F456 (Radio 3) under a new PNum = 
943 B (D18-H16-M25). However, on Radio 3 a programme item was scheduled at 17.00h 
with PNum = 9440 (D18-H 17-MOO) called “Today in Paris”. This programme will be 
deleted. 

When a programme is deleted, the original Sid and PNum codes are transmitted with 
the Update flag set to “1” and with New PNum = blank code. When a new programme item 
replaces the original scheduled programme item the PNum code for the new programme 
item is set to a new unique start time which is equal to the nominal start time of the original 
programme item minus 1 minute. 

Therefore at 17.00h, the following two FIGs 0/16 are transmitted in parallel: 



Sid 


PNum = 943B 


Cont 


Upd 


Date 


Hour 


Min 


F456 


18 


16 


59 


0 


0 



Sid 


PNum = 9440 


Cont 


Upd 


New 

Sid 


New PNum = 0001 


Date 


Hour 


Min 


Date 


Hour 


Min 


F456 


18 


17 


00 


0 


1 


F456 


00 


00 


01 



With the first FIG 0/16 receivers are informed that at 17.00h on F456 (Radio 3) a (new) 
programme item with PNum = 943B (D18-H16-M59), that is “Festival d’ Avignon”, 
replaces the original scheduled programme item. 

The second FIG 0/16 signals that the originally scheduled programme item with PNum 
= 9440 (D18-H1 7-MOO), that is “Today in Paris”, is deleted. The New PNum in this FIG is 
a blank code indicating that it is not worth recording this programme item. 

This second FIG 0/16 (which is interleaved with the first FIG 0/16) is transmitted until 
at least 17.01h. 

Example 2: Unplanned interruption 

During a traffic announcement (i.e. an unplanned interruption) the Update and Continuation 
flags are set to “0”. Suppose Radio 1 (Sid = FI 23) has scheduled on the 18th of the month 
at 9.15h a programme entitled “The Renaissance” and at 9.30h a programme item entitled 
“Festival d’Avignon”. 

At 9.15h a first FIG 0/16 is transmitted (1) with PNum = 924F. Receivers are informed 
that this is the PNum of “The Renaissance”. Receivers programmed to record this 
programme item can start recording. At 9. 1 8h there is a traffic announcement. A recorder 
recording “The Renaissance” should be warned to stop the recording temporarily. 
Therefore a new FIG 0/16 is transmitted (2) containing the interrupt code (PNum = 0002). 
When the traffic announcement is over, the same FIG 0/16 is transmitted (3) as in case (1) 
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to allow the receiver to resume recording of “The Renaissance”. At 9.30h programme item 
“Festival d’Avignon” starts. A new FIG 0/16 is transmitted (4) to allow receivers 
programmed to record this item to start recording while receivers programmed to record 
“The Renaissance” will stop recording. 



FIG 

0/16 


SId 


Date 


Hour 


Min 


PNum 


Cont 


Upd 


Remarks 


(i) 


F123 


18 


09 


15 


924F 


0 


0 


“The Renaissance” 


(2) 


F123 


00 


00 


02 


0002 


0 


0 


“Traffic announcement” 


(3) 


F123 


18 


09 


15 


924F 


0 


0 


“The Renaissance” 


(4) 


F123 


18 


09 


30 


925E 


0 


0 


“Festival d’Avignon” 



Example 3: Delay of a programme item 

If the start of a programme item is delayed due to the extension of a previous programme 
item a FIG 0/16 with Update flag = “1” and containing a New PNum with the new start 
time is transmitted. Suppose Radio 1 (Sid = FI 23) has scheduled on 18th May, at 15.05h, a 
programme item called “Roland Garros” with PNum (D18-H15-M05) = 93C5 and at 
15.45h a programme item called “Today in Paris” with PNum = D18-H15-M45 = 93ED. 
Because of the extension of “Roland Garros” the start of “Today in Paris” is postponed by 
10 minutes until 15.55h. 

At 15.45h the following FIG 0/16 is transmitted: 



SId 


PNum = 93C5 


Cont 


Upd 


Date 


Hour 


Min 


F123 


18 


15 


05 


0 


0 



This FIG 0/16 informs the receiver that at 15.45h not the expected “Today in Paris” but a 
programme item with PNum = 93C5, i.e. “Roland Garros”, is (still) on air. In other words 
“Roland Garros” is extended until after 15.45h. Receivers programmed to record “Today in 
Paris” with PNum = 93ED should be informed that this programme item is postponed until 
15.55h. Therefore another FIG 0/16 having the update flag = 1 and containing as New 
PNum = D18-H15-M55 = 93F5 (indicating the new start time of “Today in Paris”) is 
interleaved with the former FIG 0/16. 



SId 


PNum = 93ED 


Cont 


Upd 


New 

SId 


New PNum = 93F7 


Date 


Hour 


Min 


Date 


Hour 


Min 


F123 


18 


15 


45 


0 


1 


F123 


18 


15 


55 



Receivers programmed to record “Today in Paris” will wait until 15.55h to start recording. 
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2. 5. 4 Programme-related Features 

2. 5.4.1 Service Component Language 

The Service Component Language (SC Lang) feature signals the language associated with 
the content of a service component. It is used for service (component) selection based on 
language rather than the Sid, which is required for conventional service selection. 

SC Lang serves two purposes: 

1. In the case of audio service components, SC Lang is used to identify the spoken 
language associated with the audio content of a service component. SC Lang is required 
whenever a programme service has more than one secondary service component in a 
different language. If a service has no secondary service components (only a primary 
service component) then PTy language in FIG 0/17 (see 2. 5. 4. 2) is used to signal the 
language of the programme. In this case SC Lang provides complementary redundant 
information that can be used for confirmation purposes. 

2. In the case of data service components, SC Lang is used to identify the language 
associated with the content of the data. It allows the user selection of data service 
components by dint of language. 

SC Lang is encoded in FIG type 0 extension 5 (FIG 0/5) where the language is associated 
with either the Subchid, the FIDCId or the SCId depending on whether the component is 
carried in the MSC in stream mode, in the FIC or in the MSC in packet mode. In the first 
case the short form of FIG 0/5 is used (L/S flag = “0”), in the latter case the long form is 
used (L/S flag = “1”). The MSC/FIC flag indicates whether, in the short form, the 
component is carried in the MSC in stream mode or in the FIC. 

In the following example the SC Lang for a service component carried in the MSC in 
stream mode is Swedish in sub-channel “3” and Finnish in sub-channel “4”. 



L/S 


MSC/FIC flag 


Subchid 


Language 


0 


0 (MSC) 


000011 (3) 


00101000 (Ox 28) = Swedish 


0 


0 (MSC) 


000100 (4) 


00100111 (Ox 27) = Finnish 



SC language versus PTy language 

It is important to notice the subtle difference between SC language and PTy language. SC 
Lang can be used to signal the language of any service component within an ensemble, 
being primary or secondary, audio or data. PTy language is intended to be used in 
combination with the programme types and it signals the language of the primary service 
component of a service. In addition, one language can be signalled for the secondary 
service component(s). Whereas SC language cannot be used to signal the language of 
service components in another ensemble, PTy language can be cross-referenced to another 
ensemble. 

Whenever both SC language and PTy language are implemented their language fields 
should be consistent, that is, SC language should be equal to PTy language. If all services 
within an ensemble are in the same language, there is basically no need for signalling SC 
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language. However, its use is strongly recommended because some receivers may have 
implemented a language filter based on SC language. In the case of static PTy language, 
the SC language should also be static. When the service provider has a service with 
programmes targeted to different language audiences then it should signal the language 
dynamically. If it has no means for dynamic signalling, static signalling is allowed on 
condition that the PTy language is set equal to “0”. SC language in this case is not 
transmitted. 

2.5.4. 2 Programme Type 

The PTy feature provides another means of service access based on the categorisation of 
programme content by the service provider and the transmission of codes representing these 
categories to the receiver. The PTy classification in DAB has been greatly extended beyond 
the 32 potential categories available with RDS. More than one code can be assigned at a 
time. This reduces the number of times that code “0” (no PTy) need be used and allows 
programmes to be more accurately described. 

Coarse and Fine PTy codes 

There are two levels of PTy categorisation referred to as coarse and fine. Besides the 30 
categories which are available from the international table (and which provide full 
compatibility with RDS in Europe or RBDS in the United States), a further table of 32 
additional coarse codes (32-64) is available to extend the categories of the first table. 

Examples of possible coarse codes: Opera, Musical, Agriculture, Politics, Tourist 
interests, Consumer interests, Elderly interests, Women’s interests, Disabled interests, 
Minority/Ethnic, Foreign language, Environmental, Wildlife/Nature, etc. 

A further set of up to 256 fine codes is available for finer sub-divisions of some of the 
categories defined in the combined pair of tables of coarse codes. Fine codes not only 
provide a more detailed description of the programme but also allow a more efficient 
WATCH mode, especially in the case of PTy preview. It makes more sense to look for 
TENNIS than for SPORT or to look for VIOLIN CONCERTO than for CLASSICS. Table 
2.1 1 shows examples of possible fine codes (together with the associated international code 
or coarse code). Each fine code may be associated with only one course code. 

Table 2.11 Examples of possible PTy fine codes 



Int. code or 
Coarse code 


Possible PTy Fine Codes 


SPORT 


Athletics, Winter sports, Water sports, Equestrian sports, Martial 
arts, Non-instrumental ball games, Instrumental ball games, 
Football, Tennis, Wimbledon, Cricket, Basketball, Hockey, 
Formula 1, Baseball, Skiing, etc. 


CLASSICS 


Baroque music, Chamber music, Contemporary music, Vocal & 
choral music, Ballet music, Live concert, Recital, Cantata, 
Symphony, Piano concerto, Mozart, etc. 


POP 


Reggae, Beat, Techno, House, Disco, Rapp, Funk, The Sixties, etc. 


JAZZ 


Blues, Ragtime, New Orleans, Dixieland, Spiritual, Swing, etc. 


OPERA 


Grand Opera, Comic Opera, Belcanto, Oratorio, Operetta, 
Zarzuela, Singspiel, etc. 
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It is expected that the several coarse codes will be agreed internationally. Others will be 
defined only at a national level. Fine codes may be different for each country but some of 
them may also be agreed internationally. 

For coarse codes in the range 32 to 63 and all fine codes, the PTy downloading 
mechanism (see section 2. 5. 4. 3) provides the means for their “over-air” definition by 
linking these codes to a displayable label. 



Static and Dynamic PTy codes 

Service providers may wish to implement the PTy feature in different ways. The first way 
minimises the expense of the studio infrastructure by keeping the codes relatively static. 
This approach serves to reflect the general service flavour, that is PTy codes (as well as the 
language, if signalled) are assigned to a service and remain unchanged for a long period and 
may be used to SEARCH for specific programme categories. It is obvious that in this case 
the PTy may not reflect the actual on-air programme but rather the format or genre of the 
service as a whole. Secondly, the PTy feature may be operated in a dynamic mode. In this 
case the code changes at programme item junctions for example from NEWS to SPORT or 
from NEWS in SWEDISH to NEWS in FINNISH. Dynamic PTy allows the receiver to 
operate in the WATCH mode as well as in SEARCH or SELECTION mode. Theoretically 
up to eight PTy codes can be allocated to a programme: 1 static Int.code, 1 static coarse 
code, 2 static fine codes, 1 dynamic Int.code, 1 dynamic coarse code, 2 dynamic fine codes. 



The Int.code, Coarse code and Fine code fields 

PTy is encoded in FIG type 0 extension 17 (FIG 0/17) where the Sid of a service is linked 
to a language field, an Int.code, one coarse code and up to two fine codes. 

Example: Suppose there is an international agreement for the following coarse codes: 32 = 
OPERA, 33 = AGRICULTURE; and for the following fine codes: 108 = TENNIS and 176 
= FLAMENCO MUSIC. 

Signalling code “14” (CLASSICS) in the Int.code field of FIG 0/17 and code “32” in 
the coarse code field can signal an OPERA. The fine code field can contain a nationally 
agreed fine code, for example “147” = “Salzburger Festspiele“. 

A programme on AGRICULTURE can be signalled by putting value “02” (current 
affairs) or “03” (information) or “00” (no programme type) in the Int.code field of FIG 0/17 
and value “33” in the coarse code field. 

A report from a TENNIS match can be signalled by putting value “04” (SPORT) in the 
Int.code field and value “108” in the fine code field. In this case the coarse code field is not 
used but a second (nationally agreed) fine code could be used, for example, 124 = 
ROLAND GARROS. 

Putting code “26” (National music) in the Int.code field and code “176” in the fine code 
field can signal a programme with FLAMENCO MUSIC. Also in this case the coarse code 
field can but need not be used. It would for example be possible to put code “15” (Other 
music) in the Int. code field and code 26 (national music) in the coarse code field (or vice 
versa) as shown in the example below. 
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SId 


S/D 


Language 


Int.code 


Coarse Code 


Fine Code 


A301 


1 


00001000 


01110(14) 


100000 (32) 


10010011 (147) 






German 


Classics 


Opera 


Salzburger Festspiele 


8217 


1 


00011101 


00011 (03) 


100001 (33) 


Absent 






Dutch 


Information 


Agriculture 




F238 


1 


00001111 


00100 (04) 


Absent 


01 101 100 (108) Tennis 






French 


Sport 




01111100(124) Roland Garros 


E265 


1 


00001010 


01111 (15) 


11010(26) 


10110000(176) 






Spanish 


Other music 


Nat. music 


Flamenco music 



PTy of primary and secondary service components 

The PTy categorisation applies to the primary audio service component of a service. 
Secondary service components, if available, cannot be individually categorised. They 
should be of the same “type” as the primary component but they may or may not be given 
in the same language. One language code can be assigned to a service, regardless of how 
many secondary components there are. The language therefore applies to the primary 
component, which is signalled by the P/S flag in FIG 0/17 being set to “primary”. There is 
one situation where the language of a secondary component can be signalled. This is when 
the service comprises only two audio components, one primary and one secondary, and 
these two components have different languages. 



Example; Suppose a Swedish service E456 has on-air a news programme in Swedish (as 
primary SC) and in Finnish (as secondary SC). This can be signalled by means of two FIGs 
0/17: 



SId 


S/D flag 


P/S flag 


L flag 


CC flag 


Language 


Int.code 


E456 


1 (dynamic) 


0 (primary) 


1 


0 


00101000 

Swedish 


00001 

News 


E456 


1 (dynamic) 


1 (secondary) 


1 


0 


00100111 

Finnish 


00001 

News 



It is important to notice that when a service comprises secondary service components a 
SEARCH or WATCH will basically yield the primary service component. 

Example: Service FI 23 with PTy code = SPORT has a primary service component 
transmitting a report of a TENNIS match and at the same time a secondary service 
component transmitting a report of a FOOTBALL match. A SEEK or a WATCH for 
SPORT will yield the report of the TENNIS match. If a service contains one (and not more 
than one) secondary service component in a different language a WATCH for the 
(common) PTy will yield the component in the desired language if the receiver has a 
language filter based on PTy language. In the example above of the Swedish service E456, 
a WATCH for NEWS in FINNISH will directly yield the Finnish news on the secondary 
service component of that service. 
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2.5.4.3 Programme Type Downloading 

The PTy downloading feature allows a receiver to be updated with PTy descriptions that do 
not belong to the international table of fixed codes or to other codes that have been 
internationally agreed. PTy downloading applies to coarse and fine codes and is also used 
to establish a link between a coarse code and a maximum of two fine codes. 

PTy downloading is encoded in FIG type 1 extension 2 (FIG 1/2) that relates the (coarse 
or fine) PTy code to a 16-character label describing the programme type. The language 
field indicates the language of the PTy label. 

The example below illustrates the definition of fine code “57” (Piano music), in 
German, by means of FIG 1/2. The fine code is linked to coarse code “Classics” and 
labelled “KLAVIERMUSIK” in the Character field. The language is “German” and the 
code is valid in Germany (Countryld = “D”, ECC = “EO”). 



Coarse 

code 


Fine 

code 


Language 


Character field 


Chr-flag 

field 


E 

C 

C 


Country 

Id 


omo 

(14) 


00111001 

(57) 


00001000 

(German) 


KLAVIERMUSIK^ _ 


0111111110 

000000 

KLAVIER 


E0 


D 



The PTy downloading feature can also be used to transmit definitions of the fixed 
international codes in different languages. Internationally and most nationally defined codes 
are seldom redefined. However, a number of national codes may be set aside for special 
events and may occasionally be redefined. Once a downloaded code has expired this code 
should not be reused for a sufficiently long period, in order not to confuse receivers in 
which this code may have been pre-set. 

2. 5.4.4 Programme Type Preview 

The PTy preview feature allows ensemble and service providers, which implement dynamic 
PTy codes, to provide a preview of (coarse or fine) PTy codes for programmes which are 
planned to be broadcast in the next one or two hours, in the tuned ensemble as well as in 
another ensemble. The PTy preview feature is attractive for listeners operating the PTy 
feature in WATCH mode as they can now be assured that the PTy code they have selected 
will indeed become available. 

PTy preview is encoded in FIG type 0 extension 12 (FIG 0/12). A flag field containing 
coarse codes (in the range. 0 to 31 and/or 32 to 63) or fine codes (in the range 0 to 127 or 
128 to 255) is linked to the Eld (Ensemble Identifier) and the language. 

The example below illustrates a FIG 0/12 signalling PTy preview in Ensemble FI 23 
where some services will start soon with programmes having PTy coarse codes NEWS, 
SPORT, CLASSICS and POP. 



Eld 


L 

flag 


FF 


Language 


Flag field 


F123 


1 


00 (coarse codes 
0 to 31) 


00001111 

French 


00000000000000000 1 000 1 00000 10010 
(classics, pop, sport, news) 
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The language field in FIG 0/12 should be consistent with the service component language 
of FIG 0/5. The flags in the Flag field which signal the PTy coarse or fine codes should be 
reset to “0” as soon as the programme to which the code is allocated is actually transmitted. 

It is important to notice that national PTy codes can have a different meaning in 
different countries, for example, coarse code “48” could mean AGRICULTURE in 
Germany while in France coarse code “48” could mean WOMEN’S INTEREST. PTy 
preview applies to an ensemble, not to a service, that is, it is unknown in advance which 
service will start (soon) with the programme having the previewed code. For an 
unambiguous interpretation of a national PTy code both the Country Id and the ECC have 
to be known. An international ensemble may contain one or more services having a 
Country Id and ECC different from the rest of the services in the ensemble. However, the 
ECC of another ensemble cannot be signalled from the current ensemble (FIG 0/9 does not 
feature Other Ensemble functionality). Only by temporarily tuning to the other ensemble 
will the ECC of that ensemble be known. The consequence is that in some cases a preview 
of national PTy codes (coarse or fine) can be ambiguous and should therefore not be done. 

2. 5.4. 5 Announcements 

The announcement feature is similar to the traffic announcement feature of RDS but on a 
larger scale, providing more categories such as news flashes or weather bulletins. Sixteen 
announcement categories can be coded of which eleven are currently defined. 



Interrupt mechanism 

Just like TA in RDS (see section 5.4) the announcement feature in DAB allows a listener 
to be directed from the current service/audio source to a programme service, which delivers 
a short audio message. 

It is important to notice the subtle difference between the announcement feature and 
vectored dynamic PTys. “News announcements” for example are basically short flashes, 
dynamic and unscheduled for short vectored interrupts. PTy “News” on the other hand 
denotes a “type of programme” which mostly lasts tens of minutes and which is scheduled 
in advance according to a programme number (see also section 2. 5. 3. 4). 

The DAB announcement interruption mechanism is dependent on the following filter 
criteria: 

• The type of announcement, i.e. is the type of announcement supported by the service 
and has the user selected it? 

For every service which is allowed to be interrupted by announcements, static 
announcement support information is provided in FIG type 0 extension 18 (FIG 0/18), 
which comprises 16 announcement support (Asu) flags. These indicate the 
announcement types (supported within the current ensemble) which can interrupt the 
service. This information may be used by the listener to select those announcement 
types for which he or she wants the audio source to be interrupted and to deselect those 
for which he or she doesn’t want an interruption. The announcement switching 
information in FIG 0/19 comprises an announcement switching (Asw) flags field 
indicating the type of ongoing announcement; the SubChld to identify the sub-channel 
that generates the announcement, the cluster Id to identify the cluster to which the 
announcement is directed and optionally the region for which the announcement is 
applicable. The Asu flags are independent of the cluster Id. A special implementation 





60 



Digital Audio Broadcasting: Principles and Applications 



of the announcement feature employs a dedicated announcement channel occupying its 
own sub-channel. 

• The cluster Id, i.e. does the service belong to the cluster to which the announcement is 
directed? 

A cluster represents a group of services to which an announcement is directed with 
the intention of interrupting all the services participating in the cluster. A cluster Id can 
therefore be considered as “a permission-to-interrupt” indicator. The announcement 
support information of FIG 0/18 includes a list of cluster Ids, which identify those 
announcement clusters a service is participating in. If an announcement is sent to a 
cluster that is not in the list, it will not interrupt the service. Cluster Id 0000 0000 
specifies that an ongoing announcement is intended to interrupt the (audio) service 
components of all services having the signalled sub-channel identifier in their service 
organisation information. Cluster Id 1111 1111 is used for alarm announcements. The 
alarm announcement deviates from the other announcement types in that cluster Id 
11111111 causes the receiver to interrupt all services in the ensemble. 

• The Region Id (optional), i.e. does the service belong to the region to which the 
announcement is targeted and has the user selected this region? 

When an announcement is targeted to a particular geographical region, the Region 
Id is appended to the announcement switching information of FIG 0/19. When the 
region Id is absent the announcement is targeted to the whole ensemble service area. In 
the receiver a region filter is considered to be a valuable tool that can allow listeners to 
select information about any region in the ensemble service area and not just in the 
area where the receiver is situated. 

• The New flag, i.e. is the announcement that is being broadcast a new message or not? 

The New flag, signalled in the announcement switching information, is used by 
the service provider to distinguish between a message that is being broadcast for the 
first time (“new”) and a message that is repeated (“old”). The New flag is suited for a 
cyclic announcement channel, that is a sub-channel reserved for repeated emission of 
announcements and possibly not belonging to a specific service. The detection of the 
flag by the receiver allows the listener (if desired) to avoid being interrupted by 
repeated messages announcements. 

The cluster concept 

Suppose a service provider is operating five services grouped in following three clusters: 



Cluster Id 0000 0111 (7) 


Cluster Id 0000 1000 (8) 


Cluster Id 0000 1001 (9) 


F301 (Radio 1) 


F301 (Radio 1) 


F302 (Radio 2) 


F302 (Radio 2) 


F303 (Radio 3) 


F304 (Radio 4) 


F303 (Radio 3) 







Table 2.12 shows an example of possible announcement support information for these 
services by means of several FIGs 0/18. 
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Table 2.12 Example of announcement support information 



Sid 


Asu Flags 


No. of 
Clusters 


Cluster Id 1 


Cluster Id 2 


F301 


0000010000110010 


00010 


00000111 (7) 


00001000 (8) 


Radio 1 


Traffic, News, Weather, Finance 


(2) 






F302 


0000000001000000 


00010 


00000111 (7) 


00001001 (9) 


Radio 2 


Event 


(2) 






F303 


000001000001000 


00010 


00000111 (7) 


00001000 (8) 


Radio 3 


Finance 


(2) 






F304 


0000001000000010 


00001 


00001001 (9) 




Radio 4 


Sport, Traffic 


(1) 







Assume that at a certain moment Radio 1 generates an area weather flash. Radio 1 is 
participating in two clusters: 7 and 8. If the service provider does not want to disturb the 
listeners of Radio 2 then it will direct, in the announcement switching information of FIG 
0/19, the weather flash to cluster “8” so that only Radio 3 listeners may be interrupted. 



Cluster Id 


Asw flags 


New 

flag 


Subchid 


Regionld 
(lower part) 


00001000 

(8) 


0000000000100000 
Weather flash 


1 


011011 (27) 


100010(34) 

Bretagne 



This FIG 0/19 signals that the ongoing announcement is a new “weather” announcement, 
generated in sub-channel “27”, directed to cluster “8” and targeted to region “34” 
(Bretagne). This weather flash will interrupt service F303 if the user has not deselected it 
and if either the receiver is located in Bretagne or the user has selected “Bretagne” from the 
region menu. 

It is clear that the cluster concept is a powerful tool for the service provider to control 
the interruption of services by the various announcements generated by the services in the 
multiplex. In multilingual ensembles for example the cluster concept can be used to 
prevent listeners from being interrupted by announcements in a language they don’t 
understand. 

Other ensemble announcement feature 

Other ensemble (OE) announcement support information is encoded in FIG type 0 
extension 25 (FIG 0/25) and switching information in FIG type 0 extension 26 (FIG 0/26). 
The latter is used to signal the identity of the ensemble providing the announcement, 
together with the Cluster Id Current Ensemble and the Cluster Id Other Ensemble. 

In the same manner as for the basic announcement feature, the receiver can determine 
whether an interruption is permitted. The frequency information (see section 2. 5.5.1) 
together with the Eld Other Ensemble allows the receiver to determine the appropriate 
centre frequency of the other ensemble. The receiver can then retune to that frequency and 
decode the basic announcement switching information (FIG 0/19) in the other ensemble. 
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Consider several other ensembles with overlapping service areas and covering an area 
within the service area of the ensemble to which the receiver is tuned. When an 
announcement is ready to be broadcast in one of the other ensembles, the announcement 
switching information (FIG 0/19) for that ensemble is relayed to the provider of the tuned 
ensemble so that the appropriate OE service announcement switching information (FIG 
0/26) can be generated. 

The region Id specified within the switching information allows the ensemble provider 
to derive and supply one or more appropriate region identifiers from knowledge of the 
service area of the OE service. The region Ids must be chosen from those defined for use 
within the tuned ensemble. When region Ids are not allocated for the current ensemble, 
Region Id Current Ensemble = 000000 must be used. When region Ids are not allocated for 
the other ensemble the region flag must be set to “0” or Region Id Other ensemble 
“000000” must be used. 

Example of OE announcement support (FIG 0/25): 



Sid 


Asu flags 


No. of Elds 


Eld 1 


EId2 


F123 


0000000000010000 
Area weather flash 


0010(2) 


F444 


F555 



This FIG 0/25 signals that in two other ensembles (F444 and F555) “area weather flash” 
announcements are supported which can possibly interrupt service FI 23. It is important to 
notice that only one announcement support flag can be set at a time. For each other 
supported announcement type a separate FIG 0/25 must be transmitted. 

Example of OE announcement switching (FIG 0/26): 



Cluster Id 
CE 


Asw flags 


Region Id 
CE 


Eld OE 


Cluster Id 
OE 


Region Id 
OE 


00000010 

(2) 


0000000001000000 
Sport report 


100010 (34) 
Bretagne 


F555 


00000111 (7) 


001110(14) 

Rennes 



This FIG 0/26 signals that a “sport report” announcement is now ongoing in the other 
ensemble F555 where it is directed to cluster “7”. This cluster corresponds to cluster “2” in 
the current ensemble, so that the announcement can possibly interrupt service FI 23. In 
ensemble F555 the event announcement is targeted to region “14” (Rennes) which maps to 
region “34” (Bretagne) in the current ensemble. 

FM announcements feature 

FM announcement support is encoded in FIG type 0 extension 27 (FIG 0/27). FM 
announcement switching is encoded in FIG type 0 extension 28 (FIG 0/28). 

Consider an RDS service covering an area within the DAB ensemble coverage area. 
When a traffic announcement (TA) is ready to be broadcast on the RDS service, the “TA” 
flag should be relayed to the DAB ensemble provider so that the appropriate FM service 
announcement switching information (FIG 0/28) can be generated. Great care has to be 
taken to synchronise the information that is transmitted at the start of the message. 
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Therefore a raised TA flag must be present in the RDS service as long as the announcement 
is going on. The ensemble provider can supply one or more appropriate region Ids from 
knowledge of the service area of the RDS service. The region Ids must be chosen from 
those defined for use within the tuned ensemble. The selected region allows the receiver to 
filter out only those RDS traffic announcements relevant to that region. However, RDS 
services may not be available over the whole DAB ensemble coverage area. The receiver 
must therefore check as a background task, using the frequency information feature (see 
section 2.5.5. 1), which RDS services are currently available at the receiver’s location. 
Instead of the listener choosing a particular region(s), the relevant “local” region may be 
established automatically by using an accurate receiver location-finding mechanism based 
on the TII feature (see sections 2. 5. 5. 3 and 7.3.7) and regional identification features (see 
section 2. 5. 5. 4). 

Example of FM announcement support (FIG 0/27): 



Sid 


Nr of PI codes 


Pll 


PI2 


PI3 


PI4 


FI23 


0100 (4) 


FA1 1 


FB22 


FC33 


FD44 



This FIG 0/27 signals that four RDS stations, FA1 1, FB22, FC33 and FD44, are TP stations 
on which traffic announcements can be generated which can possibly interrupt DAB 
service FI 23 in the current ensemble. 

Example of FM announcement switching (FIG 0/28): 



Clusterld CE 


Regionld CE 


PI 


00000001 (1) 


001110(14) 

Rennes 


FA33 



This FIG 0/28 signals that an ongoing traffic announcement on RDS station FA33 (Radio 
Rennes) is relayed by the DAB ensemble provider to the current ensemble and directed to 
cluster “1” so that it can possibly interrupt DAB service FI 23. As the traffic announcement 
from Radio Rennes is rather of local importance, the DAB ensemble provider has decided 
to target it to region “Rennes” rather than region “Bretagne”. 

2. 5.4.6 Other Ensemble Services 

The Other Ensemble Services (OE services) feature indicates which other ensembles are 
carrying a particular service. The OE services feature can be used for both programme 
services and data services. The feature is encoded in FIG type 0 extension 24 (FIG 0/24) 
where a link between the Sid of a service and Elds of other ensembles carrying that service 
is provided. 

When the OE flag in the header of FIG 0/24 header is set to “0”, FIG 0/24 provides a 
link between a service carried in the current ensemble and other ensembles carrying that 
service. When the OE flag is set to “1”, FIG 0/24 provides a link between a service carried 
in another ensemble and other ensembles carrying that service. 

The example below illustrates the case where a programme service FI 23 (16 bits) is an 
unscrambled service which is carried in three other ensembles F444, F555, ¥ 666 . 
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SId 


Rfa 


CAId 


Number of Elds 


Eldl 


EId2 


EId3 


F123 


0 


000 


0011 


F444 


F555 


F666 



Although the OE services feature is optional it becomes essential if the frequency 
information associated with a particular service needs to be signalled. However, a receiver, 
using the OE information alone in order to present the listener with a list of possible 
services from which to make a selection would produce a long list for the whole country, 
the majority of which may not be available to the listener. Therefore the Transmitter 
Identification Information (TII) feature (see section 2. 5. 5. 3) is important because receivers 
using TII codes in order to identify the transmitter(s) they are tuned to can estimate their 
own location. OE information should be kept up to date as far as possible. After a multiplex 
reconfiguration in another ensemble there may be some obsolete information sent for some 
time. This period should be minimised. 

2.5,5 Tuning Aids 



2.5.5.1 Frequency Information 

The Frequency Information (FI) feature is used to signal the radio frequencies for other 
DAB ensembles and other broadcast services, such as FM (with and without RDS) and 
AM. 

When used in combination with the region definition feature (see section 2. 5. 5. 4) a 
geographical area filter may be provided to allow the receiver to determine which of the 
(many) frequencies listed are worth checking in the area where it is situated. 

The FI feature allows mobile receivers leaving the ensemble coverage area to retune to 
an alternative frequency (service following). The alternative frequency may apply to an 
identical DAB ensemble (same Eld), another ensemble carrying the equivalent primary 
service component or an RDS or AM service. The services can be identical (Sid = PI or 
dummy code) or hard-linked (see section 2. 5. 5. 2). If there is no alternative source for the 
same component the FI feature can help receivers to find a primary service component, 
which is soft-linked to the currently selected one. 

The FI feature also helps receivers to get faster access to other DAB ensembles that are 
available in certain regions. An ensemble scan may be speeded up if the receiver knows the 
centre frequencies of all other DAB ensembles. 

In combination with the OE/FM announcement feature (see section 2. 5. 4. 5) the FI 
feature can establish a link between services and frequencies needed for announcement 
switching. In conjunction with other ensembles or the FM PTy WATCH function FI allows 
fast access to services starting to broadcast a programme item of the desired type. 

FI Lists per Region Id 

The FI feature is encoded in FIG type 0 extension 21 (FIG 0/21) where, per region Id, one 
or more FI lists are given. An FI list can contain up to two DAB ensemble frequencies and 
up to eight RDS frequencies so that a maximum of four DAB ensemble frequencies or 17 
RDS frequencies can be signalled per region in one FIG 0/21. 

Thanks to the Regionld the receiver can check only a restricted and yet directly relevant 
list of alternative frequencies but it requires the receiver to know where it is located. The 
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Regionld is the full 1 1-bit version so that the region can but need not be a labelled region 
that is known to the user. 

Regionld = “0” means that no area is specified. In this case the FI is valid for the whole 
coverage area of the signalled ensemble. The R&M field allows differentiating between 
DAB ensembles and FM services (with or without RDS) and AM services (MW in steps of 
9 kHz or 5 kHz). 

Continuity Flag 

The continuity flag is an indication whether “continuous output” can be expected or not, for 
example, when switching to another DAB ensemble or when switching to equivalent and 
hard-linked services. The continuity flag may be set to “1” if the conditions listed in Table 
2.13 are fulfilled. 

The frequency of an ensemble is denoted as geographically adjacent when this ensemble 
is receivable somewhere in the region specified in the region Id field. This is important in 
the case of a multiple frequency network (MFN) comprising several SFNs at different 
frequencies. Not-geographically-adjacent-area means that it is not guaranteed that the 
ensemble for which the centre frequency is signalled is receivable somewhere in the region 
specified in the region Id field. 



Table 2.13 Conditions for putting continuity flag = “1” 



In case of frequencies 
of DAB ensembles 
(covering adjacent 
areas) 



In the control field the transmission mode must be signalled. 

When the frequency of an ensemble is signalled, this ensemble must be 
synchronised in time and frequency with the current (tuned) ensemble. 
This means that the null symbols of corresponding transmission frames 
must coincide (tolerance = guard interval duration). The centre 
frequency of the signalled ensemble must not deviate by more than 10 
Hz from the nominal CEPT frequency. 

Services present in the tuned and referenced ensemble should have 
identical CU start address, sub-channel size and error protection profile 
for their primary service component. For the primary service 
component the CA conditions need to be unique and the CIF counter 
values must be equal. 



These continuity requirements apply to all frequencies in the “Freq. 
list” field. 

In case of frequencies The time delay between the audio signal of the DAB service and the 
of FM /AM services analogue service must not exceed 100 ms. 



Importance of the OE flag 

Particular attention should be given to the OE flag in the sub-header of FIG 0/2 1 . 

• The OE flag is set to “0” if the FI applies to the whole tuned ensemble or to an FM or 
AM service carrying a primary service component from the tuned ensemble (PI or 
dummy code = Sid). 

• The OE flag is set to “1” if the FI applies to frequencies of ensembles other than the 
tuned ensemble or to FM/AM services which are not identical to a primary service 
component from the tuned ensemble. 
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For the latter two cases the continuity flag is set to “0”. 

Table 2.14 below shows the settings of the OE flag and Cont. flag in relation to the Id 
field. 



Table 2.14 FIG 0/21: OE and continuity flag settings in relation to the Id field 



Id Field 


R&M 


OE Flag 


Cont. Flag 


Control Field 


Eld of current ensemble 


0000 


0 


0 or 1 


used 


Eld of another ensemble 


0000 


1 


Oor 1 


used 


RDS station with PI = Sid 


1000 


0 


0 or 1 


absent 


RDS station with PI * Sid 


1000 


1 


0 


absent 


FM station with dummy Id = SId 


1001 


0 


0 or 1 


absent 


FM station with dummy Id * SId 


1001 


1 


0 


absent 


AM station with dummy Id = SId 


1010 


0 


Oor 1 


absent 


AM station with dummy Id * SId 


1010 


1 


0 


absent 



Examples of FIG 0/21 

Example 1: The following FIG 0/21 illustrates the signalling of the centre frequency of a 
DAB ensemble F888 in the region of Marseille. The OE flag is set to “1”. The ensemble 
operates in transmission mode II and is receivable somewhere in the Marseille region so 
that the control field is coded as 00100, meaning “geographically adjacent, transmission 
mode II”. 



OE flag 


Regionld 


Id field 


R&M 


Cont. 

flag 


Length of 
freq. list 


Control 

field 


Freql 


1 


0x028 

Marseille 


0xF888 

(Eld) 


0000 

(DAB) 


0 


Oil 

(3 bytes) 


00100 
geog. adj. 
mode II 


0xlC53C 

(1463.232 

MHz) 



Example 2: The following FIG 0/21 illustrates the signalling of three RDS frequencies of 
France Inter for the region of Marseille. 



Regionld 


Id field 


R&M 


Cont. 

flag 


Length 

of 

freq. list 


Freql 


Freq2 


Freq3 


0x028 

Marseille 


0xF201 

(PI) 

Fr. Inter 


1000 

(RDS) 


0 


011 

(3 bytes) 


0x26 

91.3 MHz 


0x2A 
91.7 MHz 


0x4F 
95.4 MHz 



2. 5.5.2 Service Linking 

When different services carry the same audio programme, service linking information is 
used to signal when the services may be treated as identical; in this case the services are 
said to be “hard-linked” together. The second use allows services to be generically related. 
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This means that they do not necessarily carry the same audio but the kind of programmes 
supported is similar in some way, for example different local services from the same 
service provider. In this case the services are said to be “soft-linked” together. 

When the listener is moving outside the reception area of an ensemble, the service 
linking feature helps the mobile receiver to find alternatives if the originally tuned service 
is no longer available. The listener may in this way “follow” and retain the same 
programme. In the case of dual DAB/RDS receivers the feature allows service following 
between the same service on DAB and on RDS. In the case of simulcast programmes in 
different ensembles or simulcasting on DAB and RDS common identifiers may be used 
instead of service linking because in this case frequency information provides all the 
information needed for service following. 

Short and Long form of FIG 0/6 

Service linking is encoded in FIG type 0 extension 6 (FIG 0/6) which has two versions: a 
short form and a long form. 

The short form is used for rapid delinking or to signal a Change Event (CEI) and 
contains the Linkage Set Number (LSN) together with a number of flags such as the LA 
flag (Linkage Actuator), the S/H flag (Soft/Hard) and the ILS (International Linkage Set 
Indicator). 

The long form has an Id list of service identifiers appended (PI codes or dummy codes 
referring to the linked services). 

The LSN is a 12-bit code, unique to each set of services that are actually or potentially 
carrying a common programme. 

The LA flag indicates if a service is actually (LA - 1) or potentially (LA = 0) a member 
of the set of programme services described by the LSN. 

The ILS flag indicates that the link applies to only one country (national) or several 
countries (international). 

The combination of the LSN, ILS and S/H parameters identifies a particular set of 
services and constitutes the “key” for the service linking feature. This key must be unique 
for the area where it can be received and also for any adjacent area. There is no relation 
between sets with a common LSN, but with different settings of the ILS. When a link 
ceases the LSN may be reused again for another set of services. 

The IdLQ (Identifier List Qualifier) in the Id list usage field is used to indicate whether 
the Id in the Id list is a DAB Sid or an RDS PI code or a dummy code. 

The Shd flag is a shorthand indicator used to indicate that the identifiers in the Id list, 
having the second nibble in the range “4” to “F”, each represents a list of up to 12 services 
each sharing the same country Id and the same 8 LSBs of the service reference. 

Reference Identifier 

If one of the services of a linkage set belongs to the current ensemble the OE flag in the 
FIG 0/6 header is set to “0”. The identifier of this service is a “reference identifier” and 
must be included at the beginning of the Id list regardless of the IdLQ. 



Example: Assume a set of 16 DAB services with Sid = F21 1, F212,..., F227 and a set of 16 
RDS services with PI = F301, F302,... , F316. 
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When the current ensemble contains service F21 1 then this is a reference service and is 
put first in the Id list. As there are in total 32 services to be put in the Id list, the list has to 
be split into four parts, carried in four different FIGs 0/6: 



FIG 

index 


OE 


SIV 


IdlQ 


Id list 


a 


0 


0 (start list) 


01 ( RDS PI) 


F2 11 (reference), F301, ...,F311 


b 


0 


1 (cont. list) 


00 (DAB Sid) 


F212, F213, F224 


c 


0 


1 (cont. list) 


00 (DAB Sid) 


F225, F226, F227 


d 


0 


1 (cont. list) 


01 (RDS PI) 


F312, ..., F316 



If none of the services belongs to the current ensemble, the OE flag is set to “1”. In this 
case no reference identifier is defined so that the order of identifiers within the Id list is 
arbitrary. 

Example: Same assumptions as above except that none of the services in the linkage set 
belong to the current ensemble: 



FIG 

index 


OE 


SIV 


IdlQ 


Id list 


a 


1 


0 (start list) 


01 (RDS PI) 


F301, F302, ..., F312 


b 


1 


1 (cont. list) 


00 (DAB Sid) 


F21 1, F212, ..., F223 


c 


1 


1 


01 (RDS PI) 


F313, F314, F315, F316 


d 


1 


1 


00 (DAB Sid) 


F224, F225, F226, F227 



2.5.S.3 Transmitter Identification Information Database 

The Transmitter Identification Information (TII) database feature helps the receiver to 
determine its geographical location. TII codes can be inserted into the synchronisation 
channel of the transmitted DAB signal. The “pattern” and “comb” of the TII signals 
correspond to the Mainld (Main identifier) and Subld (Sub-identifier) of the transmitter. 

The TII database feature provides the cross-reference between these transmitter 
identifiers and the geographic location (in terms of coarse and fine longitude and latitude) 
of the transmitters. This can be useful in cases where a DAB receiver needs to make a 
decision to retune to another frequency and the decision criterion is marginal using the TII 
list alone. The TII database feature provides a much better specified area definition than is 
possible with only the geographical description in the region definition. If signals can be 
received from three or more transmitters, the receiver can use this information to perform a 
triangulation and pinpoint its position (see section 7.3.7). 

The TII database feature is encoded in FIG type 0 extension 22 (FIG 0/22). The M/S 
flag allows differentiation between signalling the reference location of the Mainld (M/S 
flag = 0) and signalling the location of transmitters relative to that reference location (M/S 
flag ~1). The location reference does not necessarily need to coincide with a transmitter 
location, but if it does then the latitude offset and longitude offset are set to “0” for that 
transmitter. 
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For terrestrial frequency planning it may be necessary to delay somewhat the various 
transmitter signals constituting an SFN in order to optimise the combined signal at the 
receiver. The Til database feature therefore comprises a TD (Time Delay) field, which 
signals the constant time delay (in microseconds) of the transmitter signal in the range 0 to 
2047 ps. 

In the receiver the latitude of the received transmitter is calculated by converting the 
two’s complement value given in the combined Latitude coarse and fine fields (20 bits) to 
its decimal equivalent and then multiplying this value by 9072 19 . The longitude of the 
transmitter is calculated in a similar way but the decimal equivalent has to multiplied by 
18072 19 . The resolution of latitude is 0.6” which is about 19 metres, while the resolution of 
longitude is 1.2” which corresponds to about 38 metres at the equator (or 19 metres at 60° 
latitude). 

When M/S flag = 1 the location of the transmitters relative to the reference location is 
given in terms of the latitude and longitude offset of the transmitter from the reference 
associated with the same Mainld. The maximum offset from the reference location that can 
be signalled is 5.625° of latitude and 1 1.25° of longitude. 

The following example illustrates the case where the reference position corresponds to a 
transmitter located at 48°07’29” North and 001°37’33” West and which has in its null 
symbol a TII code with pattern number = “3” and comb number = “1”, which means that 
Main Id = 000001 1 and Sub Id = 00001. The latitude is +48°07’29” with 07’ = (07/60)° = 
0.1 1666° and 29” = (29/3600)° =0.008055° so that +48°07’29” = 48.1247222°. 

In FIG 0/22, for M/S = “0”, the latitude is coded as a 20-bit number comprising a 16-bit 
latitude coarse field and a 4-bit latitude fine field. A two’s complement number has to be 
calculated such that when the decimal equivalent “L” of this number is multiplied by 
90°/2 19 the result is +48.1247222°. L • 90°/2 19 = +48.1247222° so that L = +280346.8262 
= 0100 0100 0111 0001 1010 or 0x4471 A. 



Table 2.15 Example of calculation of the latitude/longitude coarse and fine fields in FIG 0/22 





Latitude Coarse 


Longitude Coarse 


Geographical co-ordinates 


48°07’29” (North) 


001°37’33” (West) 


In decimal degrees 


+48.1247222° 


-1.6258333° 


Formula 


L-(90°/2 l9 ) = +48.124722 


L • (180°/2 I9 )=-1. 625833 


Decimal value of L 


+280346 


-4735 


Binary value of L 


0100 0100 0111 0001 1010 


oooooooi ooiooin mi 


One's complement of L 




mi mo lioi iooooooo 


Two's complement of L 




nn mo noi iooooooi 


Value to be put in coarse field 


0100 0100 0111 0001 


nn mo noi iooo 




(0x4471) 


(0xFED8) 


Value to be put in fine field 


1010 


0001 



Since L is a positive number the two’s complement of L corresponds to the binary 
value. The 16 MSBs of this number are put in the latitude coarse field and the 4 LSBs are 
put in the latitude fine field. The values for the longitude coarse and fine fields can be 
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derived in a similar way. Since the longitude is West of Greenwich, value L is negative. 
Table 2.15 shows how the values to be put in the longitude coarse and fine fields are 
derived from the two’s complement number. 

This results in the following coding of FIG 0/22 for M/S = 0 (Main Identifier): 



M/S 


Mainld 


Latitude coarse 


Longitude coarse 


Latitude 

fine 


Longitude 

fine 


0 


0000011 


0100 0100 0111 0001 


1111 1110 1101 1000 


1000 


0101 




(3) 


(0x4471) 


(0xFED8) 


(8) 


(5) 



Since transmitter 3/1 is the reference position the latitude offset and longitude offset are 
equal to zero so that FIG 0/22 for M/S = 1 for transmitter 3/1 would look like: 



M/S 


Mainld 


Subld 


Time delay 


Latitude offset 


Longitude offset 


1 


0000011 

(3) 


00001 

(1) 


00000000000 
(0 ns) 


0000 0000 0000 0000 
(0x0000) 


0000 0000 0000 0000 
(0x0000) 



Suppose that around this reference position in the SFN there are three transmitters (3/2, 3/3, 
3/4) with Mainld - 0000011 (3) and with Subld = 00010 (2), 00011 (3) and 00100 (4). 
Suppose that the latitude offset for transmitter 3/2 is 00°13’14” North and the longitude 
offset is -000° 14’ 15” West (see Table 2.16). The time delay for this transmitter is 14 ps. 

Table 2.16 Example of calculation of the latitude/longitude offset fields in FIG 0/22 





Latitude Offset 


Longitude Offset 


Geographical co-ordinates 


00° 13’ 14” North 


000°14’15” West 


Decimal degrees 


+0.220555° 


-0.2375° 


Formula 


L-(90°/2 19 ) = +0.220555° 


L-(180°/2 19 ) = -0.2375° 


Decimal value of L 


+1285 


-692 


Binary value of [L] 


0000 0101 0000 0101 


0000 0010 1011 0100 


One’s complement of L 




mi noi oioo ion 


Two’s complement of L 




nil noi oioo noo 


Hexadecimal value of L 


0x0505 


0xFD4C 



FIG 0/22 for M/S = 1 (Sub Identifier) for transmitter 3/2 could look like: 



M/S 


Mainld 
( 7 bits) 


Subld 
(5 bits) 


Time delay 
(11 bits) 


Latitude offset 
(16 bits) 


Longitude offset 
(16 bits) 


1 


0000011 

(3dec) 


00010 

(2dec) 


00000001110 
(14 ns) 


0000 0101 0000 0101 
(0x0501) 


mi noi oioo noo 

(0xFD4C) 
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The co-ordinates of transmitter “3/2” are +48°7’29*’ + 00°13’14” = +48°20’43” latitude 
North and -001 o 37’33” - 000°14’15” = -001°51’48” longitude West. 

2.S.5.4 Region Definition and Region Label 

Geographical regions within an ensemble coverage area can be identified by means of the 
region definition feature of FIG type 0 extension 1 1 (FIG 0/11) and the region label feature 
of FIG type 1 extension 3 (FIG 1/3). The region definition feature, which is based on lists 
of transmitter identification codes, allows a receiver to determine roughly where it is 
located. This knowledge allows a receiver to filter out information which is relevant only to 
that region. For example, a much shorter list of alternative frequencies can be checked for 
service following and alarm announcements can be confined to the region concerned. 

The region label feature of FIG 1/3 provides a textual description of a region (e.g. 
Bretagne, Oberbayem, Kent, etc.). Region labels help a listener to select a region in 
conjunction, for example, with the announcement feature. Traffic information concerning 
the destination and the route can be requested as well as that for the starting point of a 
journey. The user can select, say, “Kent” from a region menu if he or she is interested in 
getting information from this region or for filtering (traffic or weather) announcements. 

A region is uniquely identified by means of an 11 -bit Regionld, which is composed of 
two parts: a 5-bit upper part and a 6-bit lower part. This allows up to 2047 regions to be 
identified within an ensemble coverage area. The region label uses only the 6-bit lower part 
(the 5-bit upper part is set to all zeros) so that up to 64 regions per ensemble can be 
labelled. 

Description of a region 

With FIG 0/1 1 there are two ways to describe a region: 

1. By means of a list of transmitter identifiers (TII list). In this case a Regionld is 
associated to a group of transmitters with one common Mainld and different Sublds. 
By means of this list the receiver can determine in which region it is located. This is 
useful for receiver processing in the case of frequency information to select a suitable 
alternative frequency. 

Example: 



Region Id 


Main Id 


Subldl 


Subld2 


Subld3 


Subld4 


Subld5 


00000 10110 
(0x002E) 

Greater 

London 


0001011 

(OxOB) 


00001 

Crystal 

Palace 


00010 

Alexandra 

Palace 


00011 

Guildfor 

d 


00100 

Bluebell 

Hill 


00101 

Reigate 



2. By means of a spherical rectangle. In this case a Regionld is associated to a 
geographical area in terms of a spherical rectangle from which the co-ordinates of the 
South-west comer are given as latitude/longitude coarse coded as a 16-bit two's 
complement number. The size of the spherical rectangle is defined by the extent of 
latitude/longitude coded as a 12-bit unsigned binary number. Region identification 
based on the spherical rectangle allows navigation systems based on GPS to be used 
for automatic receiver location. 
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Example: 



Regionld 


Latitude coarse 


Longitude coarse 


Extent of 
latitude 


Extent of 
longitude 


00000000001 

(0x001) 


1101001000111010 

(0xD23A) 

-32° 1 ri2”(South) 


1111100111010010 

(0xF9D2) 

-008°4 1 ’26’ ’(West) 


000111111111 
(Ox IFF) 
(01°24’12”) 

± 156 km 


000010100000 
(OxOAO) 
(000°52 , 44”) 
± 100 km at 
equator 



Importance of the G/E flag 

The G/E flag in FIG 0/1 1 determines the validity area of the Regionld, that is whether the 
Regionld applies to the ensemble (E) coverage area only (e.g. the Bretagne ensemble), or to 
a “global” (G) coverage area, that is the area defined by the country Id and ensemble ECC 
(e.g. the whole of France). 

Each ensemble provider in a country may use none, some or all of the globally defined 
Regionlds that it finds appropriate for its purposes. All other Regionlds may be defined for 
use ensemble-wide, even if they were globally defined through the above-mentioned 
agreement but not used as global ones in this ensemble. 

Suppose that in France there is an agreement to use code 00000001110 (OxOOOE or 
14dec) as a globally defined Regionld to denote “Bretagne”. An ensemble provider in Paris 
may use this code in its Paris ensemble to denote “Versailles” on condition that the 
provider puts in FIG 0/1 1 the G/E flag - 0 (E) for Regionld 00000001 1 10 and refrains from 
using it as the global code for “Bretagne”. 

For announcements in other ensembles or in FM the selected region allows the receiver 
to filter out only those announcements relevant to that region. However, RDS or OE 
services may not be available over the whole DAB coverage area. The receiver must 
therefore check as a background task, using the FI feature, which RDS or OE services are 
currently available at the receiver’s location in order to prevent the receiver from retuning 
to an RDS or OE service and finding that it cannot be received. Regions must always be in 
terms of those defined for the tuned ensemble. 
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Audio Services and 
Applications 



THOMAS BOLTZE, WOLFGANG HOEG and GERHARD STOLL 



3.1 General 

The DAB system is designed to provide radio services and additional data services. This 
chapter focuses on the main audio services as the central application. The audio services 
use MPEG Audio Layer II to provide mono, stereo and multichannel programmes to the 
listener. Although the first broadcast applications and receivers will only supply stereo 
programmes, the migration to multichannel services is nevertheless already included in the 
system. 

Data services will become more and more important and one can currently see many 
new ideas on their use being developed and tested. Most advanced is the Internet, but it is 
easy to envisage that the data capacity of the DAB system can be used for very similar 
services. This chapter focuses on the main service of DAB - the audio broadcasting service 
- and the standard that is being used. We intend to give the reader an understanding of the 
features of the audio system and the possible pitfalls without going into too many details 
that the user (content provider, broadcaster and listener) has no influence on. 

With the introduction of the Compact Disc and its 16-bit PCM format, digital audio 
became popular, although its bit rate of 706 kbit/s per monophonic channel is rather high. 
In audio production resolutions of up to 24-bit PCM are in use. Lower bit-rates are 
mandatory if audio signals are to be transmitted over channels of limited capacity or are to 
be stored in storage media of limited capacity. Earlier proposals to reduce the PCM rates 
have followed those for speech coding. However, differences between audio and speech 
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signals are manifold since audio coding implies higher values of sampling rate, amplitude 
resolution and dynamic range, larger variations in power density spectra, differences in 
human perception, and higher listener expectations of quality. Unlike speech, we also have 
to deal with stereo and multichannel audio signal presentations. New coding techniques for 
high-quality audio signals use the properties of human sound perception by exploiting the 
spectral and temporal masking effects of the ear. The quality of the reproduced sound must 
be as good as that obtained by 16-bit PCM with 44.1 or 48 kHz sampling rate. If for a 
minimum bit rate and reasonable complexity of the codec no perceptible difference 
between the original sound and the reproduction of the decoded audio signal exists, the 
optimum has been achieved. Such a source coding system, recently standardised by 
ISO/IEC as MPEG-1 Audio (IS 11172) and MPEG-2 Audio (IS 13838), allows a bit-rate 
reduction from 768 kbit/s down to about 100 kbit/s per monophonic channel, while 
preserving the subjective quality of the digital studio signal for any critical signal. This high 
gain in coding is possible because the noise is adapted to the masking thresholds and only 
those details of the signal are transmitted which will be perceived by the listener. 

3.2 Audio Coding 

3.2.1 Basic Principles 

Two mechanisms can be used to reduce the bit rate of audio signals. One mechanism is 
determined mainly by removing the redundancy of the audio signal using statistical 
correlation. Additionally, this new generation of coding schemes reduces the irrelevancy of 
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Figure 3.1 Block diagram of the ISO/IEC MPEG-1 Audio (IS 1 1172) Layer II encoder, as used in 
the EU147 DAB system 
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the audio signal by considering psycho-acoustical phenomena, like spectral and temporal 
masking (Zwicker, 1967). Only with both of these techniques, making use of the statistical 
correlation and the masking effects of the human ear, could a significant reduction of the bit 
rate down to 200 kbit/s per stereophonic signal and below be obtained. 

The encoder block diagram of MPEG Audio Layer II, called “MUSICAM” 
(Dehery, 1991) during its time of development and before standardisation in ISO/IEC, is 
shown in Figure 3.1. Consequently, Layer II is identical to the MUSICAM coding scheme, 
whereas Layer I has to be understood as a simplified version of the MUSICAM system. 
The basic structure of the coding technique which is more or less common to both, Layer I 
and Layer II, is characterised by the fact that MPEG Audio is based on perceptual audio 
coding. 

One of the basic functions of the encoder is the mapping of the 20 kHz wide PCM input 
signal from the time domain into sub-sampled spectral components. For both layers a 
polyphase filterbank which consists of 32 equally spaced sub-bands is used to provide this 
functionality. 

The output of a Fourier transform which is applied to the broadband PCM audio signal 
in parallel to the filter process, is used to calculate an estimate of the actual, time-dependent 
masked threshold. For this purpose, a psycho-acoustic model, based on rules from psycho- 
acoustics, is used as an additional function block in the encoder. This block simulates 
spectral and, to a certain extent, temporal masking too. The sub-band samples are quantised 
and coded with the intention to keep the noise, which is introduced by quantising, below the 
masked threshold. Layers I and II use a block companding technique with a scale factor 
consisting of 6 bits valid for a dynamic range of about 120 dB and a block length of 12 sub- 
band samples. With this kind of scaling technique, Layer I and Layer II can deal with a 
much higher dynamic range than CD or DAT, that is conventional 16-bit PCM. 

In the case of stereo signals, the coding option “Joint Stereo” can be used as an 
additional feature. It exploits the redundancy and irrelevancy of typical stereophonic 
programme material, and can be used to increase the audio quality at low bit rates and/or 
reduce the bit rate for stereophonic signals (Waal, 1991). The increase of encoder 
complexity is small, and negligible additional decoder complexity is required. It is 
important to mention that Joint Stereo Coding does not enlarge the overall coding delay. 

After encoding of the audio signal an assembly block is used to frame the MPEG Audio 
bit-stream which consists of consecutive audio frames. 

The frame-based nature of MPEG Layer II leads to a delay through the encode/decode 
chain which is of the order of two frames (48 ms) when the processing and the tie for 
transmission are not taken into account. Typical delays through real transmission chains are 
of the order of 80 ms. 

3.2.2 Masking of Audio Components 

In order to reduce the bit rate of an audio signal whilst retaining the bandwidth, dynamic 
structure and harmonic balance, an audio codec has to reduce the quantisation step size of 
the signal. A CD signal is typically quantised with 16 bits per sample, which for a 0 dBFS 
sine wave results in a signal-to-quantisation-noise ratio of 96 dB. Reducing the quantisation 
step size will increase the quantisation noise relative to the audio signal. If the human 
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auditory system were susceptible to this increase under all conditions, we could not reduce 
the bit rate. 

Fortunately, the ear is a very imperfect organ. When we are exposed to a tone at one 
frequency, this excites the inner ear and the neurones in the cochlea in such a way that 
signals close (in frequency) to this signal may be inaudible, if their level is sufficiently low. 
This effect, which is called masking, decreases with the distance of the so-called masker 
and the masked signal. 

A second imperfection is the fact that we do not hear audio signals below a certain 
level. This forms an absolute hearing threshold. 

Both effects vary for human beings and have been measured in the past. They form the 
basis of what is called perceptual coding. The way to exploit these effects in a codec is to 
split the signal into small frequency regions and increase the quantisation noise in each 
frequency region such that it is as large as possible while still being masked by the signals 
in that frequency region. 

The effect of masking of a single sine tone and its combination with the absolute 
hearing threshold is depicted in Figure 3.2. Two sine tones, which are masked, are shown in 
the figure as well. 




Frequency 



Figure 3.2 Masking of individual sine tone and absolute hearing threshold 
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13 dB SNR with added white noise 13 dB SNR perceptually shaped noise 



Figure 3.3 Added quantisation noise, without (left) and with (right) shaping 
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In Figure 3.3, the same amount of noise has been added to a sine tone. In the left part of 
the figure the noise has equal amplitude across all frequencies and is therefore audible. In 
the right half of the figure the noise has the same energy but has been shaped according to 
the masking curve. In this case the noise is inaudible. In the MPEG Audio codec the 
shaping of the noise is achieved by splitting the signal into 32 sub-bands before the 
reduction of the quantisation step size. 

The masking properties have been measured for single sine tones and for narrow-band 
noise signals (Z wicker, 1967). The amount of masking depends on the nature of a signal 
and furthermore on the perceived tonality of a signal. The more tonal a signal is, the 
stronger the masking. It is therefore necessary to measure the tonality of a component to 
decide its masking properties. A ready assumption that is made in perceptual coding is that 
masking also occurs for complex signals, such as music and speech. 



3.2. 3 Psycho-acoustic Model 

The psycho-acoustic model calculates the minimum masked threshold, which is necessary 
to determine the just-noticeable noise level for each band in the filterbank. The minimum 
masked threshold is the sum of all individual masking curves as described in the previous 
section. The difference between the maximum signal level and the minimum masked 
threshold is used in the bit or noise allocation to determine the actual quantiser level in each 
sub-band for each block. Two basic psycho-acoustic models are given in the informative 
part of the MPEG-1 standard (IS 1 1 172). While they can both be applied to any layer of the 
MPEG Audio algorithm, in practice most encoders use model 1 for Layers I and II. In both 
psycho-acoustic models, the final output of the model is a signal-to-mask ratio for each sub- 
band of Layer II. A psycho-acoustic model is necessary only in the encoder. This allows 
decoders of significantly less complexity. It is therefore possible to improve the 
performance of the encoder even later, relating the ratio of bit rate and subjective quality. 
For some applications, which do not demand a very low bit rate, it is even possible to use a 
very simple encoder without any psycho-acoustic model. 

The fundamental basis for calculating the masked threshold in the encoder is given by 
results of masked threshold measurements for narrow-band signals considering a tone 
masking noise and vice versa. Concerning the distance in frequency and the difference in 
sound pressure level, very limited and artificial masker/test-tone relations are described in 
the literature. The worst case results regarding the upper and lower slopes of the masking 
curves have been considered, assuming that the same masked thresholds can be used for 
both simple audio and complex audio situations. 

The output of the FFT is used to determine the relevant tonal (i.e. sinusoidal) and non- 
tonal (i.e. noise) maskers of the actual audio signal. It is well known from psycho-acoustic 
research that the tonality of a masking component has an influence on the masked 
threshold. For this reason, it is worthwhile to discriminate between tonal and non-tonal 
components. 

It is also known that the perception of the tonality of a component is time dependent. A 
pure tone is not perceived as tonal initially and builds up over time. This time dependency 
is part of more sophisticated psycho-acoustic models. 

The individual masked thresholds for each masker above the absolute masked threshold 
are calculated depending on frequency position, loudness level and tonality. All the 
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individual masked thresholds, including the absolute threshold, are added to the so-called 
global masked threshold. For each sub-band, the minimum value of this masking curve is 
determined. Finally, the difference between the maximum signal level calculated by both 
the scale factors and the power density spectrum of the FFT and the minimum masked 
threshold is calculated for each sub-band and each block. 

The block length for Layer II is determined by 36 sub-band samples, corresponding to 
1 1 52 input audio PCM samples. This difference of maximum signal level and minimum 
masked threshold is called the signal-to-mask ratio (SMR) and is the relevant input function 
for the bit allocation. 



3. 2. 4 The Filterbank 

A high-frequency resolution, that is small sub-bands in the lower frequency region, whereas 
a lower resolution in the higher frequency region with wide sub-bands should be the basis 
for an adequate calculation of the masked thresholds in the frequency domain. This would 
lead to a tree structure of the filterbank. The polyphase filter network used for the sub-band 
filtering has a parallel structure, which does not provide sub-bands of different widths. 
Nevertheless, one major advantage of the filterbank is given by adapting the audio blocks 
optimally to the requirements of the temporal masking effects and inaudible pre-echoes. A 
second major advantage is the small delay and complexity. To compensate for the lack of 
accuracy of the spectrum analysis of the filterbank, a 1 024-point FFT for Layer II is used in 
parallel with the process of filtering the audio signal into 32 sub-bands. 

The prototype QMF filter is of order 511, optimised in terms of spectral resolution and 
rejection of sidelobes, which is better than 96 dB. This rejection is necessary for a sufficient 
cancellation of aliasing distortions. This filterbank provides a reasonable trade-off between 
temporal behaviour on one side and spectral accuracy on the other side. A time/frequency 
mapping providing a high number of sub-bands facilitates the bit rate reduction, due to the 
fact that the human ear perceives the audio information in the spectral domain with a 
resolution corresponding to the critical bands of the ear, or even lower. These critical bands 
have a width of about 100 Hz in the low-frequency region, that is below 500 Hz, and a 
width of about 20% of the centre frequency at higher frequencies. 

The requirement of having a good spectral resolution is unfortunately contradictory to 
the necessity of keeping the transient impulse response, the so-called pre- and post-echo, 
within certain limits in terms of temporal position and amplitude compared to the attack of 
a percussive sound. Knowledge of the temporal masking behaviour (Fasti, 1977) gives an 
indication of the necessary temporal position and amplitude of the pre-echo generated by a 
time/frequency mapping in such a way that this pre-echo, which normally is much more 
critical compared to the post-echo, is masked by the original attack. In association with the 
dual synthesis filterbank located in the decoder, this filter technique provides a global 
transfer function optimised in terms of perfect impulse response perception. 

3.2.5 Determination of Scale Factors 

The sub-band samples are represented by a combination of scale factor and actual sample 
value. The scale factor is a coarse representation of the amplitude of either one individual 
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sample or a group of samples. This approach allows for a higher reduction of the bit rate 
than to code each individual sample. 

The calculation of the scale factor for each sub-band is performed for a block of 12 sub- 
band samples. The maximum of the absolute value of these 12 samples is determined and 
quantised with a word length of 6 bits, covering an overall dynamic range of 120 dB per 
sub-band with a resolution of 2 dB per scale factor class. In Layer I, a scale factor is 
transmitted for each block and each sub-band, which has no zero-bit allocation. 

Layer II uses additional coding to reduce the transmission rate for the scale factors. 
Owing to the fact that in Layer II a frame corresponds to 36 sub-band samples, that is three 
times the length of a Layer I frame, three scale factors have to be transmitted in principle. 
To reduce the bit rate for the scale factors, a coding strategy which exploits the temporal 
masking effects of the ear has been studied. Three successive scale factors of each sub-band 
of one frame are considered together and classified into certain scale factor patterns. 
Depending on the pattern, one, two or three scale factors are transmitted together with 
additional scale factor select information consisting of 2 bits per sub-band. If there are only 
small deviations from one scale factor to the next, only the bigger one has to be transmitted. 
This occurs relatively often for stationary tonal sounds. If attacks of percussive sounds have 
to be coded, two or all three scale factors have to be transmitted, depending on the rising 
and falling edge of the attack. 

This additional coding technique allows on average a factor of 2 in the reduction of the 
bit rate for the scale factors compared with Layer I. 



3. 2. 6 Bit Allocation and Encoding of Bit Allocation Information 

Before the adjustment to a fixed bit rate, the number of bits that are available for coding the 
samples must be determined. This number depends on the number of bits required for scale 
factors, scale factor select information, bit allocation information, and ancillary data. 

The bit allocation procedure is determined by minimising the total noise-to-mask ratio 
over every sub-band and the whole frame. This procedure is an iterative process where, in 
each iteration step, the number of quantising levels of the sub-band that has the greatest 
benefit is increased with the constraint that the number of bits used does not exceed the 
number of bits available for that frame. Layer II uses only 4 bits for the coding of the bit 
allocation information for the lowest, and only 2 bits for the highest, sub-bands per audio 
frame. 

3.2. 7 Quantisation and Encoding of Sub-band Samples 

First, each of the 12 sub-band samples of one block is normalised by dividing its value by 
the scale factor. The result is quantised according to the number of bits spent by the bit 
allocation block. Only odd numbers of quantisation levels are possible, allowing an exact 
representation of a digital zero. Layer I uses 14 different quantisation classes, containing 
2n - 1 steps, with 2 <n <15 different quantisation levels. This is the same for all sub- 
bands. Additionally, no quantisation at all can be used, if no bits are allocated to a sub- 
band. 
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In Layer II, the number of different quantisation levels depends on the sub-band 
number, but the range of the quantisation levels always covers a range of 3 to 65535 with 
the additional possibility of no quantisation at all. Samples of sub-bands in the low- 
frequency region can be quantised with 15, in the mid frequency range with seven, and in 
the high-frequency range only with three different quantisation levels. The classes may 
contain 3, 5, 7, 9, 15, 63, ...,65535 quantisation levels. Since 3, 5 and 9 quantisation levels 
do not allow an efficient use of a codeword, consisting only of 2, 3 or 4 bits, three 
successive sub-band samples are grouped together into a “granule”. Then the granule is 
coded with one codeword. The coding gain by using the grouping is up to 37.5%. Since 
many sub-bands, especially in the high-frequency region, are typically quantised with only 
3, 5, 7 and 9 quantisation levels, the reduction factor of the length of the codewords is 
considerable. 



3.2.8 Layer II Bit-stream Structure 

The bit-stream of Layer II was constructed in such a way that a decoder of both low 
complexity and low decoding delay can be used, and that the encoded audio signal contains 
many entry points with short and constant time intervals. The encoded digital representation 
of an efficient coding algorithm specially suited for storage application must allow multiple 
entry points in the encoded data stream to record, play and edit short audio sequences and 
to define the editing positions precisely. For broadcasting this is important to allow fast 
switching between different audio programmes. 

To enable a simple implementation of the decoder, the frame between those entry points 
must contain all the information that is necessary for decoding the bit-stream. Owing to the 
different applications, such a frame has to carry in addition all the information necessary for 
allowing a large coding range with many different parameters. In broadcasting frequent 
entry points in the bit-stream are needed to allow for an easy block concealment of 
consecutive erroneous samples impaired by burst errors. 

The format of the encoded audio bit-stream for Layer II is shown in Figure 3.4. Short, 
autonomous audio frames corresponding to 1152 PCM samples characterise the structure of 
the bit-stream. Each audio frame starts with a header, followed by the bit allocation 
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Figure 3.4 Audio frame structure of MPEG- 1 Layer II encoded bit stream 
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information, scale factor and the quantised and coded sub-band samples. At the end of each 
audio frame is the so-called ancillary data field of variable length that can be specified for 
certain applications. Each frame can be accessed and decoded on its own. With 48 kHz 
sampling frequency, the frame duration is 24 ms for Layer II. 



3. 2 . 9 Layer II Audio Decoding 

The block diagram of the decoder is shown in Figure 3.5. First of all, the header 
information, CRC check, the side information (i.e. the bit allocation information with scale 
factors), and 12 successive samples of each sub-band signal are extracted from the 
ISO/MPEG/ AUDIO Layer II bit stream. 



Left Channel 




Programme- 

associated 



Data (PAD) 



Figure 3.5 Block diagram of the MPEG-1 Audio Layer II decoder 



The reconstruction process to obtain PCM audio again is characterised by filling up the 
data format of the sub-band samples regarding the scale factor and bit allocation for each 
sub-band and frame. The synthesis filterbank reconstructs the complete broadband audio 
signal with a bandwidth of up to 24 kHz. The decoding process needs significantly less 
computational power than the encoding process. The relation for Layer II is about 1/3. 
Because of the low computational power needed and the straightforward structure of the 
algorithm, Layer II can be easily implemented in special VLSI chips. Since 1993, stereo 
and multichannel decoder chips have consequently been available from many 
manufacturers. 



3.3 Characteristics and Features of MPEG-1 Layer II for DAB 

The EU147 DAB source coding system permits a digital audio broadcast receiver to use a 
standard MPEG-1 and MPEG-2 Layer II decoder. Besides the emission of the digital audio 
broadcast signal, the MPEG Audio Layer II coding technique and its encoded audio bit- 
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stream can be used in a number of other applications, including contribution between 
broadcast studios, primary and secondary distribution, and news/sports reporting links. 
These different applications require a flexible coding scheme offering a wide range of 
parameters, in particular concerning the bit rates, audio modes (i.e. mono, stereo and 
multichannel representation), protection level of the coded bit-stream and the possibility to 
carry Programme-associated Data (PAD), which enable completely new applications. 

More detailed information can be found in (Stoll, 1995) or (AES, 1996). 



3.3.1 Audio Modes 

The audio coding system, used in Eureka 147 DAB, supports the following modes: 

• Mono (one-channel) mode. 

• Stereo (two-channel) mode. 

• Dual-channel mode. In this mode, the two audio channels can be either bilingual, or two 
mono channels, but with only one header. At the decoder a choice is made on which of 
the two programmes should be decoded. 

• Joint Stereo mode. In the Joint Stereo mode, the encoder exploits the redundancy and 
irrelevancy of stereo signals for further data reduction. The method used for Joint Stereo 
in Layer II is “intensity stereo coding”. This technique still preserves the spectral 
envelope of the left and right channel of a stereo signal, but transmits only the sum 
signal of the sub-band samples of the left and right channel in the high audio frequency 
region (Chambers, 1992). 

• Low sampling frequency coding with^ = 24 kHz. 

• Provisions are made in the Eureka 147 DAB standard (EN 300401) for the inclusion of 
MPEG-2 Audio Layer II 5.1 backwards-compatible Surround Sound coding. 



3. 3.2 Sampling Rate and Input Resolution 

The audio coding algorithm of DAB allows two sampling rates: 48 kHz and 24 kHz. The 
higher sampling rate can be chosen to have a lull audio bandwidth of 20 kHz for the 
transmitted signal and to allow for a direct broadcasting of studio signals without the need 
for sampling-rate conversion. The audio quality of a PCM signal improves with increasing 
resolution of the input signal. Thus, the MPEG Audio Layer II standard can handle a 
resolution of the input signal up to 22 bits/sample. 

The lower bit rate can be chosen to deliver a high quality, in particular for speech 
signals at very low bit rates, that is at bit rates <64 kbit/s per channel. However, this does 
not mean that a new sampling frequency will be introduced outside of the DAB system. 
Instead a downsampling filter from 48 kHz to 24 kHz in the audio encoder and an 
upsampling filter from 24 kHz to 48 kHz in the decoder are used. 



Note: In contrast to other broadcasting systems (e. g. ADR) a sampling frequency of 44, 1 kHz is not applicable, 
which means a source taken from a CD has to be sample-rate converted. 
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3.3.3 DAB Audio Frame Structure 

The DAB audio frame is based on the MPEG Layer II audio frame which includes all the 
main and side information which is necessary for the decoder to produce an audio signal at 
its output again. Additionally, each DAB audio frame contains a number of bytes that may 
be used to carry the PAD, that is the information which is intimately related to the audio 
programme. This structure is depicted in Figure 3.6. The PAD field contains 2 bytes of 
Fixed PAD (F-PAD), and an optional extension called the extended PAD (X-PAD). 
Functions available for the PAD include Dynamic Range Control (DRC), music/speech 
indication, programme-related text and additional error protection. The PAD features are 
explained in section 3.4. 



SCFSt STUFF 




Figure 3.6 Structure of Fixed and Extended Programme-associated Data (F-PAD and X-PAD) and 
Scale Factor CRC (SCF-CRC) applied to the MSBs of the scale factors for either four or 
two sub-band groups, depending on the bit rate 



3.3.4 Audio Bit Rates 

Depending on the type of programme, the number of programmes within the DAB 
multiplex and the protection level, different bit rates in the range of 32 to 192 kbit/s in 
single channel mode can be selected at the audio encoder. The choice of the bit rate is not 
fully independent of the audio mode. Depending on the bit rate, the audio modes given in 
Table 3.1 can be used. Any combination of these options may be used within the DAB 
multiplex (Chambers, 1992). For the half-sampling-rate coding of 24 kHz, 14 different bit 
rates in the range of 8 to 160 kbit/s can be chosen, irrespective of the selected audio mode. 
For the multichannel extension, Table 3.1 is valid for the base bit-stream only. The details 
of the lower sample rates and the multichannel extension are explained in section 3.5. 
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The wide range of bit rates allows for applications that require a low bit rate and high 
audio quality; for example, if only one coding process has to be considered and cascading 
can be avoided. It also allows for applications where higher data rates up to about 180 kbit/s 
per channel could be desirable if either cascading or post-processing has to be taken into 
account. Experiments carried out by the ITU-R (AES, 1996) have shown that a coding 
process can be repeated nine times with MPEG-1 Layer II without any serious subjective 
degradation, if the bit rate is high enough, that is 180 kbit/s per channel. If the bit rate is 
only 1 20 kbit/s, however, no more than three coding processes should occur. 



Table 3.1 Audio modes and bit rates for 48 kHz (left columns) and 24 kHz (right column) 
sampling rate 



Sampling Rate 


48 kHz 


Mode 


Single Ch. 


Dual Ch. 


Stereo 


Int. Stereo 


Bit Rate [kbit/s] 










32 


X 








48 


X 
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X 








64 
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X 
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BMN 


112 
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X 


X 
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X 
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192 
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224 
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X 




256 
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X 


320 
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X 


t^m\ 


384 




X 


X 


X 



24 kHz 


All 


Bit Rate [kbit/s] 


8 


16 


24 


32 






56 


64 


80 


96 


112 


128 


144 


160 



3.4 Programme-associated Data 

The DAB system not only provides a very high audio quality for the listener, but also 
includes provision for many additional data services (BPN 007), (Riley, 1994). Most of 
these data services are independent of the audio programme, but some of them are closely 
related to the audio signal. The latter form is the so-called Programme-associated Data 
(PAD). A fixed and optionally a flexible number of bytes are provided to carry the PAD in 
the DAB audio bit-stream, see Figure 3.6. Additional capacity is provided elsewhere in the 
DAB multiplex (or “ensemble multiplex”) to carry more independent data or additional 
information, such as text, still or moving pictures, etc., see Chapters 4 and 5. 
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The Fixed Programme-associated Data (F-PAD) are carrying amongst others: 

• Dynamic Range Control (DRC) information, which may be used in the receiver to 
compress the dynamic range of the audio. 

• Speech/music indication to allow for different audio processing of music and speech 
(e.g. independent volume control). 

As these features are directly related to the audio programme signal, they are described in 
detail in the following. For more details on other programme-associated information carried 
within the F-PAD or X-PAD (Extended Programme-associated Data), see Chapters 2, 4 
and 5. 

In reproduction of audio broadcast programmes there will always be the need to apply 
dynamic range compression to some types of programme material because of the listeners’ 
requirements. Also the satisfactory balancing of the loudness of different types of 
programme, particularly music and speech, depends principally upon the listeners’ 
requirements. In conventional broadcasting it has never been possible to satisfy different 
individual habits of listening. 

3. 4 . 1 Functionality of Dynamic Range Control 

The DAB DRC feature enables broadcasters to transmit programmes with a relatively wide 
dynamic range, accompanied by a DRC signal which the listener may use to effect 
unobtrusive compression of the programme dynamics, if required (Hoeg, 1994). 

The dynamic range of an audio programme signal (sometimes termed the programme 
dynamic) is the range between the highest and the lowest useful programme signal level. 
The problems associated with programmes having a wide dynamic range, and with 
achieving a satisfactory loudness balance between different part of the radio programme 
(such as speech or music), are well known from experience with VHF/FM broadcasting 
(Muller, 1970). In many cases the source programme dynamic or the dynamic range 
commonly used by the broadcasters (approx. 40 dB) may be much larger than the usable 
dynamic range (the so-called reproduction dynamic) in noisy environments such as in a 
moving car. The reduction required in the dynamic range of the programme may be 1 0 to 
30 dB (or more). 

Taking into account the listening conditions and the requirements of the listener, these 
problems can only be solved at the receiver. For this to be feasible, additional information 
must be provided by the broadcaster concerning the gain adjustments, which may be 
needed to reduce the dynamic range. Therefore, at the broadcaster's premises a DRC signal 
is generated at the studio side, which describes the audio gain to be applied in the receiver, 
as a succession of values. This DRC signal is transmitted in a coded form (DRC data = 
DRCD) together with the audio signal. 

It is a requirement of the DAB specification (EN 300401) that the audio signal is 
transmitted with its original programme dynamic, without any pre-compression. The DRC 
data are incorporated in the DAB bit-stream as PAD. 

In the receiver, the regenerated DRC signal may be used optionally to control the audio 
gain in order to match the dynamic range of the received audio programme to the 
requirements of the listener, or to improve audibility in difficult conditions. 
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The user can choose any appropriate compression ratio between the anchors: 

• no compression : the audio programme is reproduced with the dynamic range as 
delivered by the broadcaster, without any pre-compression before transmitting; 

• nominal compression : the audio programme is reproduced with an appropriate dynamic 
compression as adjusted by the broadcaster (normally a compression ratio of about 1.3 
is used); 

• maximum compression : the audio programme can be reproduced with an extremely 
strong compression ratio (e.g. > 2.0). This makes sense for poor reproduction 
conditions (e.g. in a car) and for special programme types only, because some 
influences on the sound quality cannot be excluded. 



3.4.2 Functionality of Music/Speech (M/S) Control 

It is well known from any kind of radio programme transmission that there may be very 
different requirements in the loudness balance between music and speech programmes, 
depending on the interests of the listener (Ilmonen, 1971). A music/speech (M/S) control 
signal, which is also transmitted in the DAB bit-stream, now enables the listener to balance 
the loudness of different types of programme according to taste. 

There is the option to signal four states of the so-called M/S flag within the PAD: 

00 programme content = Music 

0 1 programme content - Speech 

1 0 programme content - not signalled 

1 1 reserved for future use (e.g. for a programme containing music and speech with 
artistically well-balanced levels, for instance a drama or musical). 

This M/S flag information can be generated at the studio side automatically (e.g. derived 
from the corresponding channel fader at the mixing console). As an item of 
programme-related data, the information is transported within the PAD, and may be used at 
the receiver to control the output signal level (volume) in a predetermined manner, 
probably in a level range of about 0 to -15 dB, to enhance either the music, or the speech. 

TRANSMITTING END RECEIVING END 

Transmission 




Figure 3.7 Block diagram of a DRC system for DAB 
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3. 4. 3 Transport Mechanism of DRC and M/S Data 

Data travelling in the PAD channel are intimately related to the audio programme. They are 
assembled together with the coded audio in the same DAB frame, so the time relation 
established at the source is maintained throughout the complete DAB transmission chain. 
Audio and PAD must not be subject to different transmission delays. 

Figure 3.7 shows a simplified block diagram of the DRC system for DAB, which is also 
valid in principle for transmission of the M/S control flag. 

The DRC signal is generated by a characteristic which is linear over most of the 
dynamic range, with an appropriate compression ratio. An overall range of gain variation of 
about 1 5 dB is linearly coded in a 6-bit digital word, for each 24 ms frame of the broadcast 
audio signal. 

The intention is to provide long-term compression of the dynamic range whilst 
preserving the impact of dramatic changes in the programme dynamics (e.g. that caused by 
a crescendo or subito fortissimo). Therefore the generating algorithms use specific sets of 
audio programme controlled time characteristics, which are not the subject of any 
specification. 

The DAB audio programme signal itself is transmitted without any dynamic 
compression applied after the production/post-production stage. During the production of 
some types of programmes, compression may be applied for “artistic” reasons. However, 
programme signals compressed at this stage are treated in the same way as uncompressed 
signals by the DRC system. 

Because the DRC data are transmitted discontinuously in time intervals of one DAB 
frame (24 ms) to avoid overshoots (i.e. brief periods in which the reproduced signal is 
excessively loud) in the output from the receiver, it is necessary to have a look-ahead time 
of one complete DAB frame in which to analyse the level of the signal. However, a delay is 
still needed in the signal path so that the incoming audio signal may be examined prior to 
the application of any changes in gain. This delay is for instance one frame length (24 ms), 
but may also be implemented with a significantly longer delay, up to several seconds. 

3. 4. 5 The Receiving End 

A common and simple receiver concept realises the necessary DRC functions. These 
functions do not depend upon the type of algorithm in the DRC signal generator used at the 
transmitting end. The same options for listening to the uncompressed programme, the 
programme with the nominal degree of compression and the programme with a greater 
degree of compression are always available. 

The DRC data are demultiplexed from the PAD area, and a linear interpolation 
regenerates a continuous signal to control the gain of the audio channel, in order to match 
the dynamic range of the received programme to the individual listener’s needs. The 
maximum gain change in the compression mode will be about 15 dB. If the error protection 
applied to the PAD fails, specifically defined limiting characteristics for the rate of gain 
change will protect the compressed audio signal against severe impairments. 

The conventional way for controlling the dynamic range is to use a simple 
multiplication in the digital domain, or a VCA (or DCA) at the analogue end of the receiver 
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to realise all necessary gain adjustments (DRC, manual or automatic volume control and 
other gain-related features such as the balancing of music and speech loudness). 

3, 4. 6 Implementations 

Solutions fulfilling the DAB requirements for DRC data have been developed by the BBC 
and Deutsche Telekom with Fa. Junger Audio as well, see (Hoeg, 1994); there is also an 
existing solution from Swedish Broadcasting. The BBC system normally needs a 
significant look-ahead time, typically from 30 ms up to 3 s, but the Deutsche Telekom 
system typically needs only about 24 ms. In the meantime, there are also several 
configurations of DAB receiver equipment (see Chapter 8) which support the DRC and 
M/S functionality. 

Besides the DRC system with control data being transmitted as described in the DAB 
standard (EN 300401), self-contained compressors for use in receivers which are operating 
without the need for transmitted control data have been proposed several times. One of 
these (Theile, 1993) is based on scale factor weighting in the MPEG-1 Layer II source 
decoder (the so-called MUSICAM-DRC system). 



3.5 Half-sampling-rate and Multichannel Audio Coding 

One of the basic features of the MPEG-2 Audio standard (IS 13818-3) is the backward 
compatibility to MPEG-1 (IS 11172) coded mono, stereo or dual-channel audio 
programmes. This means that an MPEG-1 audio decoder is able to properly decode the 
basic stereo information of a multichannel programme. The basic stereo information is kept 
in the left and right channels, which constitute an appropriate downmix of the audio 
information in all channels. This downmix is produced in the encoder automatically. 

The backward compatibility to two-channel stereo is a strong requirement for many 
service providers which may provide high-quality digital surround sound in the future. 
There is already a wide range of MPEG-1 Layer I and Layer II decoder chips which support 
mono and stereo sound. 

The use of multichannel audio is becoming adopted with the advent of digital terrestrial 
television (DVB-T, see section 1.5). With the backward compatibility of the MPEG 
multichannel audio coding standard, such a two-channel decoder will always deliver a 
correct stereo signal with all audio information from the MPEG-2 multichannel audio bit- 
stream. 

MPEG-1 audio was extended as part of the MPEG-2 activity to lower sampling 
frequencies in order to improve the audio quality for mono and conventional stereo signals 
for bit rates at or below 64 kbit/s per channel, in particular for commentary applications. 
This goal has been achieved by reducing the sampling rate to 16, 22.05 or 24 kHz, 
providing a bandwidth up to 7.5, 10.5 or 11.5 kHz. The only difference compared with 
MPEG-1 is a change in the encoder and decoder tables of bit rates and bit allocation. The 
encoding and decoding principles of the MPEG-1 Audio layers are fully maintained. 
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3.5. 1 Characteristics of the MPEG-2 Multichannel Audio Coding System 

A generic digital multichannel sound system applicable to television and sound 
broadcasting and storage, as well as to other non-broadcasting applications, should meet 
several basic requirements and provide a number of technical/operational features. Owing 
to the fact that during the next few years the normal stereo representation will still play a 
dominant role for most of the consumer applications, two-channel compatibility is one of 
the basic requirements. Other important requirements are interoperability between different 
media, downward compatibility with sound formats consisting of a smaller number of audio 
channels and therefore providing a reduced surround sound performance. In order to allow 
applications as universal as possible, other aspects, like multilingual services, clean 
dialogue and dynamic range compression, are also important. 

MPEG-2 audio allows for a wide range of bit rates from 32 kbit/s up to 1066 kbit/s. 
This wide range could be realised by splitting the MPEG-2 audio frame into two parts: 

1. The primary bit-stream which carries the MPEG-1 compatible stereo information of 
maximal 384 kbit/s; 

2. The extension bit-stream which carries either the whole or a part of the MPEG-2 specific 
information, that is the multichannel and multilingual information, which is not relevant 
to an MPEG-1 audio decoder. 

The primary bit-stream realises a maximum of 448 kbit/s for Layer I and 384 kbit/s for 
Layer II. The extension bit-stream realises the surplus bit rate. If, in the case of Layer II, a 
total of 384 kbit/s is selected, the extension bit-stream can be omitted. The bit rate is not 
required to be fixed, because MPEG-2 allows for variable bit rate which could be of interest 
in ATM transmission or storage applications, such as DVD (Digital Video Disk), 

Serious quality evaluations (BPN 019), (Wiistenhagen, 1998) have shown that a bit rate 
of about 512 to 640 kbit/s will be sufficient to provide a reasonable audio quality for 
multichannel audio transmission using MPEG-2 Layer II audio coding (non-coded 
transmission of five signals would need a net rate of 5 x 768 - 3.840 kbit/s). 

3.5.2 3/2-stereo Presentation Performance 

The five-channel system recommended by ITU-R (BS.775-1), SMPTE (RP200) and EBU 
(R96) is referred to as “3/2-stereo” (3 front/2 surround channels) and requires the handling 
of five channels in the studio, storage media, contribution, distribution, emission links, and 
in the home. 



3.5.3 Low-frequency Enhancement Channel 

According to (BS.775-1), the 3/2-stereo sound format should provide an optional Low- 
frequency Enhancement/Extension (LFE) channel in addition to the full range main 
channels. The LFE is capable of carrying signals in the frequency range 20 Hz to 120 Hz. 
The purpose of this channel is to enable listeners, who choose to, to extend the low- 
frequency content of the audio programme in terms of both low frequencies and their level. 
From the producer’s perspective this may allow for smaller headroom settings in the main 
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audio channels. In practice, the LFE channel is mainly used with film sound tracks. The 
mentioned 3/2 format is commonly called “5.1 format”, where “.1” means the LFE channel. 

3.5.4 Backward/forward Compatibility with MPEG-l 

For several applications it is the intention to improve the existing 2/0-stereo sound system 
step by step by transmitting additional sound channels (centre, surround), without making 
use of simulcast operation. The multichannel sound decoder has to be backward/forward 
compatible with the existing sound format. 




Figure 3.8 Backward compatibility of MPEG-2 Audio with MPEG-1 



Backward compatibility means that the existing two-channel (low-price) decoder 
should properly decode the basic 2/0-stereo information from the multichannel bit-stream 
(see Figure 3.8). This implies the provision of compatibility matrices (Kate, 1992) using 
adequate downmix coefficients to create the compatible stereo signals L 0 and Ro, shown in 
Figure 3.9. The inverse matrix to recover the five separate audio channels in the MPEG-2 
decoder is also shown in the same figure. 

The basic matrix equations used in the encoder to convert the five input signals L, R, C, 
LS and RS into the five transport channels T 0 , Ti, T 2 , T 3 and T 4 are: 
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T 0 = L 0 = (a x L) + p x (a x C) + y x (a x LS) 
Ti = Ro = (a x R) + p X (a X C) + y X (a X RS) 
T 2 = C w = axpxC 
T 3 = LS W = a x y x LS 
T 4 = RS W = a X y X RS 



Matrixing Transmission Dematrixing 




Figure 3.9 Compatibility matrix (encoder) to create the compatible basic stereo signal, and 
the inverse matrix (decoder) to re-establish the discrete five audio channels 



In order to obtain maximal bit-rate reduction, T 2 , T 3 and T 4 are also allowed to carry 
(a x L) and/or (a x R) instead of the listed (p x a x C), (y x a x LS) and (y x a x RS). 

Four matrix procedures with different coefficients a, p and y have been defined and can 
be chosen in the MPEG-2 multichannel encoder. Three of these procedures add the centre 
signal with 3 dB attenuation to the L and R signals. The surround signals LS and RS are 
added to the L, respectively R, signals with either 3 dB or 6 dB attenuation. The possibility 
of an overload of the compatible stereo signals L 0 and Ro is avoided by the attenuation 
factor a which is used on the individual signals L, R, C, LS and RS prior to matrixing. One 
of these procedures provides compatibility with Dolby Surround® decoding. Being a two- 
channel format, compatibility can already be realised in MPEG-1. MPEG-2 allows such 
transmissions to be extended to a full, discrete five-channel size. 

The fourth procedure means that no downmix is included in the bit-stream, which 
actually constitutes a “Non-backwards Compatible” (NBC) mode for the MPEG-2 
multichannel codec. An MPEG-1 decoder will produce the L and R signals of the 
multichannel mix. In certain recording conditions this ’’matrix" will provide the optimal 
stereo mix. 

Forward compatibility means that a future multichannel decoder should be able to 
decode properly the basic 2/0-stereo bit-stream. 
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The compatibility is realised by exploiting the ancillary data field of the ISO/IEC 1 1172-3 
audio frame for the provision of additional channels (see Figure 3.10). The “variable 
length” of the ancillary data field gives the possibility to carry complete multichannel 
extension information. If for Layer II the bit rate for the multichannel audio signal exceeds 
384 kbit/s, an extension part is added to the MPEG-1 compatible part. 




Header 



Ext-Sync 

Ext-CRC 



Ext-MC-Audio Data 

Ext*Anc. Data 




MPEG-2 Audio Frame - MPEG-1 "Compatible Part' 



Figure 3.10 ISO/IEC 13818-3 (MPEG-2 Audio) Layer II multichannel audio frame consisting of 
the MPEG-1 compatible part and the extension part 



However, all the information about the compatible stereo signal has to be kept in the 
MPEG-1 compatible part. In this case, the MPEG-2 audio frame consists of the MPEG-1 
compatible and the (non-compatible) extension parts. This is shown in Figure 3.10. 

In the first generation the DAB system (EN 300401) will not provide multichannel 
sound. Therefore the extension to digital surround sound has to be backward/forward 
compatible with an MPEG-1 Audio decoder. 



3 . 5 . 5 Compatibility with Matrix Surround 

Matrix surround systems, such as Dolby ProLogic®, have found wide acceptance in 
consumers’ homes. Many movies and drama programmes are produced not only in discrete 
multichannel but also in matrix surround for delivery on video tape and analogue broadcast. 

To broadcasters this legacy of matrix surround systems means that they should be able 
to continue to broadcast matrix surround material with the digital system. The audio coding 
system should be compatible with the matrix surround system. 

The BBC conducted a very good study on the compatibility of MPEG Layer II and 
Dolby ProLogic® (then the dominant matrix surround system) and found that, provided 
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some precautions are taken, the surround information is completely preserved 
(Meares, 1998). 



3 . 5 . 6 Multilingual Extension and Associated Services 

MPEG-2 Audio provides alternative sound channel configurations in the multichannel 
sound system, for example the application of the “second stereo programme” might be a 
bilingual 2/0-stereo programme or the transmission of an additional binaural signal. Other 
configurations might consist of one 3/2 surround sound plus accompanying services (e.g. 
clean dialogue for the hard-of-hearing, commentary for the visually impaired, multilingual 
commentary, etc.). For these services, either the multilingual extension or the ancillary data 
field, both provided by the MPEG-2 bit-stream, can be used. 

A good case for providing a multilingual service in combination with surround sound is 
given when the spoken contribution is not part of the acoustic environment that is being 
portrayed. Surround sound sports effects can be combined with multiple language mono 
commentary channels. In contrast, surround sound with drama would require a new five- 
channel mix for each additional language. 

An important issue is certainly the “final mix in the decoder”; that means the 
reproduction of one selected commentary/dialogue (e.g. via centre loudspeaker) together 
with the common music/effect stereo downmix (examples are documentary film, sport 
reportage). If backward compatibility is required, the basic signals have to contain the 
information of the primary commentary/dialogue signal, which has to be subtracted in the 
multichannel decoder when an alternative commentary/dialogue is selected. 

In addition to these services, broadcasters should also be considering services for 
hearing-impaired consumers. In this case a clean dialogue channel (i.e. no sound effects) 
would be most advantageous. This could be transmitted at a low bit rate of about 48 kbit/s 
with the lower sampling frequency coding technique. This provides excellent speech 
quality at a bit rate of 64 kbit/s and even below, making very little demand on the available 
capacity of the transmission channel (BPN 021). 

3.6 Quality of Service 

The quality of service in digital transmission systems is typically much more constant than 
with analogue transmissions, such as FM or AM. The digital transmission system provides 
a very robust channel for the compressed audio data. Variations in the quality of the 
transmission channel, fading effects, shadowing and interference with other sources that 
affect the audio quality directly in an analogue system will have much less influence on the 
perceived quality in a digital system. The reason for this is that the digital modulation, 
together with the error correction, protect the audio data from corruption. 

As long as the error correction can correct the influence of the channel, no change in the 
audio stream will occur, resulting in a constant quality over time. Only when the error 
correction fails will the audio be impaired. If this happens, the severity of the impairment 
depends on which part of the audio frame is corrupted. 

Under the assumption that the error correction works, the quality of service in a digital 
system is mainly determined by the combination of bit rate, codec implementation and 
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actual programme material. Another factor is the listener. Experience shows that what 
might be an inaudible artefact when heard first might become very obvious when the 
listener learns to identify it as an artefact. This is true for any kind of coding scheme and 
also holds in the field of video compression. 



3. 6. 1 Quality Versus Bit Rate 

The perceived audio quality depends on the bit rate in a non-linear fashion. Towards higher 
bit rates saturation around imperceptible differences with the original can be observed, 
while the quality suddenly drops towards lower bit rates. At this point the coding scheme 
simply cannot provide satisfactory results and a number of different artefacts (such as 
quantisation noise, bandwidth limitation and change of the stereo image) occur all at once. 

The MPEG-1 standard (as it is used on DAB) uses a sample rate of 48 kHz. MPEG-2 
has defined an extension to 24 kHz in addition. Both are allowed in DAB. 

The motivation behind the extension to lower sample rates was to provide a better 
frequency resolution for signals which are band limited by nature. These are typically 
speech programmes, which constitute a significant part of broadcasts. 

The limitation to a bandwidth of about 1 1 kHz results in a higher frequency resolution 
at frequencies where the ear is very sensitive to errors. This results in a crossover point 
between 48 kHz and 24 kHz, at which it is more attractive to use either 48 kHz or 24 kHz, 
depending on the actual bit rate and the programme material. 

The data presented in Figure 3.1 1 and Figure 3.12 are taken from (EBU, 1999). It is a 
summary of several different listening tests, performed by several international 
organisations. The listening tests were executed according to the ITU rules and the scale 
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Figure 3.11 MPEG Layer II, stereo 
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Figure 3.12 MPEG Layer II, mono 



used is that of absolute impairment when compared to the original. The three points for 
each bit rate give the mean score for the least critical item and the most critical item on the 
upper and lower end and the mean score over all items in the middle. 

We mentioned earlier that the audio quality for a certain bit rate and setting depends on the 
actual codec implementation, because the psycho-acoustic model and the bit allocation 
strategy are not part of the standard. 

Experience with developing encoders shows that the differences in audio quality can be 
fairly significant. The results given above can only be an indication of the general trend, but 
should not be used as the only rule to set all the coding parameters in the daily practice. A 
further reason for this lies in the nature of listening tests: in every well-organised listening 
test, most of the time is spent on finding the most critical material for all encoders involved, 
whilst maintaining a balance between different types of material and the different encoders. 
This leads to results for the worst case of known material, but not for typical material. 

Further information on the relationship between quality and bit rate can be found in 
(AES, 1996). 

3.7 Error Protection and Concealment 

Only very limited error protection for the audio bit-stream is provided in the MPEG 
standard. Error protection has to take into account the characteristics of the source data and 
the transmission channel. The MPEG Audio standards have been written for a wide range 
of applications with very different transmission channels, ranging from nearly completely 
transparent (i.e. error-free transmission channels, like storage on computer hard disks) to 
very hostile transmissions paths, like mobile reception with DAB (EN 300401). 
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3. 7 . 1 Error Protection 

In the presence of only a very few errors, say with a bit error rate of about 10' 5 to 10' 6 and 
lower, the optional CRC check, provided in the ISO standard, will in general be an efficient 
tool to avoid severe impairments of the reconstructed audio signal. Errors in the most 
sensitive information, that is header information, bit allocation (BAL) and scale factor 
select information (SCFSI), can be detected. The odds of bit errors in this part of the audio 
frame are small. If, however, one single bit error occurs in these fields, the result will be the 
loss of a complete audio frame. In this case, the result of a single bit error is the same as if a 
complete audio frame is lost by a burst error or cell loss. 

To protect a listener of ISO/MPEG Audio coded audio signals from annoying 
distortions due to bit errors, channel coding has on the one hand the task to correct as many 
bit errors as possible, and on the other hand to enable a detection of residual bit errors. In 
the Eureka 147 DAB system, the data representing each of the programme services being 
broadcast is subjected to energy dispersal scrambling, convolutional coding and time 
interleaving. The convolutional encoding process involves adding redundancy to the service 
data using a code with a constraint length of 7. In the case of an audio programme, stronger 
protection is given to some bits than others, following a pre-selected pattern known as the 
Unequal Error Protection (UEP) profile, shown in Figure 3.13. The average code rate, 
defined as the ratio between the number of source-encoded bits and the number of encoded 
bits after convolutional encoding, may take a value from 0.33 (the highest protection level) 
to 0.75 (the lowest protection level). Different average code rates can be applied to different 



Example: 192 kbit/s, Stereo, Protection Level 3, Rav = 0.5 
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Figure 3.13 Unequal error protection scheme used in the DAB system for an ISO/IEC 1 1172-3 
Layer II audio frame 
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audio sources. For example, the protection level of audio services carried by cable networks 
may be lower than that of services transmitted in radio frequency channels. 

Even with the best error correction scheme residual bit errors cannot be completely 
avoided, and have to be detected, especially for information which is very sensitive to bit 
errors. 

This includes the header and control information, which has been taken into account by 
the ISO CRC, but also for scale factors, which has been only partially considered by 
ISO/MPEG. 

The optimisation criterion for protecting encoded audio signals against bit errors is not 
the minimisation of the bit error ratio as the most important issue, but to minimise the 
perception of audio signal distortions in the case of bit errors. The subjective annoyance of 
bit errors depends strongly on the kind of disturbed data for the ISO/MPEG Audio coded 
signal. 

Whereas a bit error rate of 10' 4 for sub-band samples results in barely audible 
degradation, any single bit error in the control information (header, BAL or SCFSI) causes 
a total frame error of 24 to 36 ms. The individual bit error ratios have to be balanced with 
respect to a certain degradation with the goal of minimising the overall distortion. Besides 
the economical aspect due to minimal additional redundancy, a UEP scheme allows for 
“graceful degradation” which may be very convenient for the listener in the case of an 
increase of the bit error rate. 

In the case of the Eureka 147 DAB system, the following measures for residual error 
detection strategies can be used: 

Frame CRC (First Cyclic Redundancy Check), specified by ISO/IEC (IS 1 1 172). A 16-bit 
parity check word can be used for error detection of the main audio information within the 
encoded bit-stream, that is ISO header, BAL and SCFSI. If the CRC is violated a 
concealment based on non-reliable frames is requested. 

Scale factor CRC (Second Cyclic Redundancy Check), specified by Eureka 147/ETSI 
(EN 300401). Up to four 8-bit parity check words can be used for error detection of single 
scale factors within sub-band groups. If the CRC is violated a concealment based on non- 
reliable scale factors is requested. 

Reliability information . If conventional or punctured convolution codes are used for the 
error protection, additional reliability information could be derived by a Viterbi channel 
decoder. Reliability information derived by the channel decoder will give more information 
on the actually violated data and would support the adaptation of concealment to the error 
situation. 

Failure characteristics of the DAB system have also been tested by subjective 
assessment, see for instance (Lever, 1997). 



3. 7.2 Concealment Measures 

Error correction and error detection strategies can be optimised concerning both the 
encoded audio signal and the listener’s perception. Although some measures for an error 
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correction may have been applied, the channel coding technique has to provide provisions 
for the detection of residual bit errors, because every channel code can be overloaded. 
Several error concealment strategies in combination with low bit rate coding, like muting, 
substitution, repetition or estimation of the disturbed parts of the received encoded audio 
data, have already been proposed and described in (Dehery, 1991). Those techniques 
improve the audio quality in case of transmission errors, but some of them, for example 
left/right-substitution, are not compatible with the audio bit-stream, and it cannot be 
expected that those techniques will be applied. 

Applying error concealment techniques should result in the following improvements for 
the listener: 

• Improved sound quality at the edge of the coverage area (single and burst errors) 

• Improved sound quality during bad mobile reception conditions (single and burst errors) 

• Improved sound quality during cell loss in ATM networks (burst errors) 

• Performance and costs trade-offs between simple and cheap receivers and sophisticated 
and expensive receivers. 

None of these benefits are available to the user in the old analogue system. 



3. 7.3 Frame-oriented Error Concealment 

A complete frame or even some successive frames have to be processed with frame- 
oriented error concealment if the frame CRC indicates an error. Depending on the 
receiver’s capabilities several concealment techniques can be applied. 

Muting . The simplest method of frame-related error concealment is to mute frames which 
are indicated as non-reliable or even non-decodable. Compared to a very annoying 
reception the subjective annoyance caused by occasional muting of these frames might be 
assessed as lower. 

Repetition . Instead of muting, non-decodable frames can be replaced by previously correct 
decoded frames. The audio quality of a repetition depends strongly on the signal itself; 
mostly it is much better compared to a simple muting. Contrary to muting, the repetition of 
frames needs memory of at least one frame (768 bytes at a bit rate of 256 kbit/s). A longer 
break of some frames can be bridged by either multiple repetition of one frame or, 
depending on the memory provided in a receiver, by a single repetition of the necessary 
number of previously correct decoded frames. 



3. 7.4 Error Concealment of Scale Factors 

Scale factors represent the maximum absolute value of a sequence of samples in each sub- 
band. Transmission errors of scale factors do not lead to a distortion of one complete frame, 
but can be compared to a change produced by an equaliser in a way where one or more sub- 
bands are amplified or attenuated by the disturbed scale factor. If MSBs of scale factors are 
not decoded correctly, “beeping”-type impairments are produced and perceived as “very 
annoying”. 
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The probability density function of differences of successive scale factors shows that for 
90% of the time there are only small changes in the range of plus or minus one scale factor 
class. It is very difficult to detect such an error, which is limited to one sub-band and to an 
amplification or attenuation of about 2 dB. 

Based on these statistical data of scale factors, a very simple error concealment can be 
applied. The disturbed scale factors are replaced by previously correct decoded scale 
factors. This can be considered as a concealment technique with no or only small 
impairments of quality. 



3. 7.5 Error Concealment by Plausibility Checks 

The statistical properties of different information within one audio frame consisting of a 
DAB header, BAL, SCFSI, scale factors and sub-band data can be exploited for error 
detection. 

An analysis of the successive scale factors of each sub-band from the point of view of 
statistical properties of audio signals can give additional and more detailed information on 
the reliability of the received scale factors. During an attack of the audio signal (e.g. a piano 
chord), scale factors may change greatly within a short time and may be followed by scale 
factor values which decrease slowly. During the decay of an audio signal continuously 
decreasing values of scale factors can be expected. This means that a high increase 
followed by a high decrease is not plausible. Thus, the scale factor has to be indicated as 
non-reliable and has to be concealed by, for example, repetition or interpolation. Compared 
to other possibilities of error protection and detection codes for the scale factors, this type 
of plausibility analysis does not need additional channel capacity and thus may lead to a 
similar evaluation of the error situation and performance. 

In addition the received scale factor values can be compared to the range of values 
expected by the SCFSI. If the SCFSI indicates the transmission of three scale factors within 
one sub-band, a certain difference (depending on the encoder algorithm) between the 
successive scale factors can be assumed. A more detailed investigation of these 
interdependencies might lead to a basis for a predictor model comparing received audio 
data with predicted data and classifying the received audio data as more or less reliable. 



3.8 A Typical DAB Ensemble 

The DAB system is based on a multiplex that can assign different bit rates to each 
individual programme and that also allows dynamic reconfiguration. The latter could be 
used to split one high-quality programme into two of lower quality or vice versa. 

The actual protection ratio and bit-rate for each individual programme depend on the 
codec used, the material for that programme and the intended audience. Although it would 
be desirable from a quality point of view always to use high bit rates, this is not 
economical. More programmes at acceptable quality are a better deal than fewer 
programmes at guaranteed CD quality. 

Table 3.1 below provides possible values taken from (BPN 007). Table 3.2 lists the type 
of programme with recommended bit rates and channel configurations for certain quality 
levels. We can see a fairly wide range of bit rates for all different quality levels. The coding 
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mode (stereo or Joint Stereo) is not included in this table, but experience shows that the use 
of Joint Stereo coding can reduce the required bit rate. Again the existing codec 
implementations differ significantly, and the actual audio quality therefore does so too. 



Table 3.2 Sets of suitable parameters for different materials and quality levels of audio services 


Programme 


Format 


Quality Level 


Sampling Rate 


Protection 


Bit Rate 


Type 


* 




kHz 


Level 


kbit/s 


Music/Speech 


1/0 


Broadcast quality 


48 


UEP_2 or 3 


112 to 
160 




2/0 


Broadcast quality 


48 


UEP_2 or 3 


128 to 
224 




3/2 


Broadcast quality 


48 


UEP2 or 3 


384 to 
640 


Speech only 


1/0 


Acceptable quality 


24 or 48 


UEP_3 


64 to 












112 


Messages 


1/0 


Intelligible message 


24 or 48 


UEP4 


32 or 












64 


Data 


Non 






UEP4 


32 to 












64 


* Format codes: 


l/o 


mono 










2/0 


two-channel stereo 










3/2 


multichannel 









An actual ensemble, as broadcast in Bavaria, Germany at the time of writing of this 
book, is summarised in Table 3.3. The bit rates reflect the development that has taken place 
over the years in the encoders used. They are slightly lower than in (BPN 007). Also joint 
coding is employed for all stereo programmes. 

As more and more countries start their DAB services, more information will be come 
available. The figures taken from the Ensemble Bayern can be regarded as fairly typical and 
should be a good starting point for a broadcaster to plan a service. Another actual multiplex 
configuration is also described in Chapter 5, Table 5.1. 



Table 3.3 Actual values for Ensemble “Bayern” 



Sub-channel 


Bit Rate 
kbit/s 


Audio Mode 


Protection 

Level 


Service Component 


2 


160 


Joint Stereo 


UEP 3 


Bayern 2 Radio 


3 


160 


Joint Stereo 


UEP 3 


Bayern 3 


4 


192 


Joint Stereo 


UEP 3 


Bayern 4 Klassik 


5 


96 


Single Channel 


UEP 3 


Bayern 5 Aktuell 


1 


160 


Joint Stereo 


UEP 3 


Bayern Mobil 


7 


192 


Joint Stereo 


UEP 3 


Rock Antenne 


6 


160 


Joint Stereo 


UEP 3 


Radio Galaxy 


11 






UEP 4 


MobilData BDR 


11 






UEP 4 


Update 
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Data Services and Applications 
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4.1 General 

While DAB development started mainly focused on audio broadcast, data capabilities have 
been designed into the system right from the beginning. After successful presentations of 
the new digital audio broadcast system at the beginning of the 1 990s, these data capabilities 
gained more and more interest and prototype data services were demonstrated by different 
project partners of Eureka 147 the related events (e.g. at NAB in Las Vegas, Spring 1992). 
In order to harmonise these prototype data services and to reach compatibility between the 
equipment and services, data interfaces, protocols and applications were defined to match 
the needs of broadcasters, service providers and end users. In the meantime it is generally 
accepted that the usage of some DAB channel capacity, especially for multimedia data 
services, is crucial both for user acceptance and for the financing of the networks of this 
new digital broadcast medium. All the necessary hooks are available to follow this route. 

4.2 Multimedia Applications with MOT 

4.2. 1 The Multimedia Object Transfer Protocol 

4.2. LI General Principles 

The “Multimedia Object Transfer” (MOT) Protocol (EN 301234) is a transport protocol for 
the transmission of multimedia content in DAB data channels to various receiver types with 
multimedia capabilities. It is the common transport mechanism for transmitting information 
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